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REMARKS 

Attached hereto is a marked up version of the changes made to the specification by this 
amendment. The attachment is captioned "Version With Markings to Show Changes Made." 

Reconsideration and withdrawal of the rejections of this appUcation and consideration 
and entry of this paper are respectfully requested in view of the herein remarks and 
accompanying information, which place the application in condition for allowance. 

I. STATUS OF CLAIMS AND FORMAL MATTERS 

Claims 1-25 are pending. Claims 1-25 are amended without prejudice, without 
admission, without surrender of subject matter, and without any intention of creating any 
estoppel as to equivalents. 

No new matter is added. Support for the amended claims is found throughout the 
specification. 

It is submitted that these claims are patentably distinct from the references cited by the 
Examiner, and that these claims are in full compliance with the requirements of 35 U.S.C. §112. 
The amendments of the claims herein are not made for the purpose of patentability within the 
meaning of 35 U.S.C. §§ 101, 102, 103 or 1 12; but rather the amendments are made simply for 
clarification and to round out the scope of protection to which Applicants are entitled. 

II. RESPONSE TO RESTRICTION REQUIREMENT 

The March 30, 2001 Office Action required an election under 35 U.S.C. § 121 and 37 
CFR 1.141 from among: 

Group I. SEQ ID NOs 1 and 7; 

Group II. SEQ ID NOs 2 and 8; 

Group III. SEQ ID NOs 3 and 9; 

Group IV. SEQ ID NOs 4 and 1 0; 

Group V. SEQ ID NOs 5 and 1 1 ; and ' 

Group VI. SEQ ID NOs 6 and 12. 

Pursuant to the telephone conversation of February 20, 2001 with Examiner Souaya, the 
election of Group I, SEQ ID NOs 1 and 7, with traverse, is affirmed. Reconsideration and 
withdrawal of the restriction requirement are respectfully requested in view of the remarks 
herewith. 
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As a traverse, it is noted that the MPEP lists two criteria for a proper restriction 
requirement. First, the inventions must be independent or distinct. MPEP § 803. Second, 
searching the additional inventions must constitute an undue burden on the examiner if 
restriction is not required. Id, The MPEP directs the examiner to search and examine an entire 
application "[i]f the search and examination of an entire application can be made without serious 
burden, ...even though it includes claims to distinct or independent inventions." Id. 

The amino acid sequences (SEQ ID NOs 1-6) and the nucleic acids encoding them (SEQ 
ID NOs 7-12) are drawn functionally to the same enzyme, a glucan lyase. Furthermore, the 
invention is drawn to a process of producing an anti-oxidant in situ, in which process, preferably, 
an a-l,4-glucan lyase is used. The sequences 1-6 are given as six possible a-l,4-glucan lyases, 
as mere examples. Thus, each sequence does not constitute a separate invention, as the invention 
is the process of producing an anti-oxidant in situ from a glucan substrate. 

In addition, it is respectfully submitted that a search of all of the related nucleotide and 
amino acid sequences presented in this application also does not pose a serious burden to the 
examiner. For example, as shown by the enclosed comparison, the peptides represented by SEQ 
ID NOs 2, 5 and 6 and the corresponding nucleic acids of SEQ ID NOs 8, 1 1 and 12 share 
greater than 74% sequence homology to SEQ ID NOs 1 and 7 respectively, underscoring the 
interrelated nature of these molecules. It would be unduly burdensome on the Applicants to have 
to file divisional applications to each amino acid sequence and corresponding nucleotide 
sequence, particularly as a "broad" process claim is sought. 

Therefore, Groups I, II, III, IV, V and VI can be searched and examined in this 
application as there is no undue or serious burden in searching and examining these claims 
together. Thus, the restriction requirement is improper; and, it is respectfully requested that the 
restriction requirement be reconsidered and withdrawn. 

III. THE REJECTION UNDER SlOl IS OVERCOME 

Claims 19-25 stand rejected under 35 U.S.C. §101 as allegedly not reciting any steps 
involved in the process. The rejection is respectfully traversed. The amendment herein has 
placed these claims in condition for allowance, rendering the rejection moot. Simply, the "use" 
recitation is no longer in the claims. Consequently, reconsideration and withdrawal of the 
rejection is respectfully requested. 
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IV. THE REJECTION UNDER SI 12. FIRST PARAGRAPH, IS OVERCOME 

Claims 1-18 stand rejected under 35 U.S.C. §112, first paragraph, as allegedly containing 
non-enabling subject matter. The Office Action contends that the application fails to teach the 
method of the claimed invention with regard to any recombinant enzyme expressed in any plant 
to produce any anti-oxidant from a glucan substrate. Applicants disagree. The instant invention 
is clearly enabled because a skilled artisan would readily understand how to make and use the 
invention. Further, possession of the invention at the time of filing did exist. 

According to the Court of Appeals for the Federal Circuit in the case of In re Wands, 8 



Enablement is not precluded by the necessity for some 
experimentation such as routine screening. However, 
experimentation needed to practice the invention must not be undue 
experimentation. The key word is undue, not experimentation.' The 
determination of what constitutes undue experimentation in a given 
case requires the application of standard of reasonableness, having 
due regard for the nature of the invention and the state of the art. The 
test is not merely quantitative, since a considerable amount of 
experimentation is permissible, if it is merely routine, or if the 
specification in question provides a reasonable amount of guidance 
with respect to the direction in which the experimentation should 
proceed ... [Citations omitted]. 



Against this background, determining whether undue experimentation is required to 
practice a claimed invention turns on weighing many factors summarized in In re Wands, 858 
F.2d 731, 8 USPQ2d 1400 (Fed. Cir. 1988) For example, (1) the quantity of experimentation 
necessary; (2) the amount of direction or guidance presented; (3) the presence or absence of 
working examples of the invention; (4) the nature of the invention; (5) the state of the prior art; 
(6) the relative skill of those in the art; (7) the predictability or unpredictability of the art; and (8) 
the breadth of the claims. 

Thus, the assertion in the Office Action that the instant invention does not provide 
enablement for expressing any recombinant enzyme in any plant to produce any anti-oxidant 
from a glucan substrate is misplaced because undue experimentation would not exist. Applying 
Wands to the instant facts, it is clear that enablement exists, to wit, inter alia, that the quantity of 
experimentation necessary is low; the amount of direction or guidance presented is high; 



U.S.P.Q. 2d 



1400 (Fed. Cir. 1988), 



Id, at 1404. 
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working examples are clearly present; the relative skill of those in the art is high; and the 
predictability of the art is also high. 

A skilled artisan would readily understand that a variety of recombinant enzymes which 
act on a glucan substrate, such as starch, to produce an anti-oxidant, such as anhydrofructose, are 
known in the art. For example, the enzyme amyloglucoside can act on a glucan substrate to 
V produce glucose, which is then further converted to the anti-oxidant ascorbic acid, for example 
^ . ^ by glucose oxidase. Thus, it would be unduly restrictive to limit the invention to either a specific 

^ enzyme and/or to a specific anti-oxidant. 

^^"^ 

^ V In addition, several plants, namely grape, potatoes, maize and guar, are, in fact, 

^s* exemplified (see pages 23-3 1 of the published PCT specification). It is hereby asserted that the 
present invention does disclose the invention in a manner which is clear and concise and which 
would enable a skilled person to carry out the invention. 

Thus, it is respectfully submitted that the assertion in the Office Action that the claims are 
not enabled is obviated. Consequently, the Section 1 12, first paragraph, rejection should be 
reconsidered and withdrawn; and, such relief is respectfully requested. 

V. THE REJECTION UNDER §112. SECOND PARAGRAPH, IS OVERCOME 

Claims 1 and 17-25 stand rejected under 35 U.S.C. §112, second paragraph, as allegedly 
being indefinite. The amendments submitted herewith place the claims in condition for 



r allowance and have rendered the Section 1 12, second paragraph, rejection moot. Simply, the 
bases for the rejection no longer appear in the claims. Consequently, reconsideration and 
withdrawal of the rejection is believed to be in order and such action is respectfully requested. 

VL THE REJECTION UNDER S103 IS OVERCOME 

Claims 1-18 stand rejected under 35 U.S.C. 103(a) as allegedly being unpatentable over 
Yu(a) or Yu(b) in view of Poulsen and Ishida et al. and Perl et al. The cited documents fail to 
disclose, suggest, or motivate a skilled artisan to practice, the presently claimed invention. The 
rejection is respectfully traversed. 

The present invention relates to advantageous procedures based on the recombinant 
expression of glucan-metabolising enzymes in plants to produce anti-oxidants in situ. 

One aspect of the present invention is based on a strategy of producing, in a medium, 
anti-oxidants which are metabolites of glucan by expressing within a plant, or part thereof, a 
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recombinant enzyme which acts on a giucan substrate present within the medium (including the 
plant or part thereof) to yield the anti-oxidant. This is in direct contrast to previously known 
procedures for producing giucan metabolite anti-oxidants which either rely on the addition of 
purified enzymes to a glucan-containing medium or use microorganisms expressing the enzymes. 

In another aspect of the present invention, anti-oxidants are produced in situ in plants 
transformed with enzymes that convert glucans to anti-oxidant metabolites. This may be used, 
for example, to improve the stress tolerance of the plants or to improve the efficiency of 
transformation with foreign nucleic acid constructs. 

Claim 1 and dependent claims involve a medium comprising a plant or part thereof. 
Thus, the amended claims are novel over the prior art disclosures. Although Yu(a) and Yu(b) 
may relate to the expression of a-1 ,4-glucan lyase in a host organism, such that it is capable of 
producing 1,5-D-anhydrofructose (AF), there is no mention of that organism being a plant. 
Rather, Yu(a) and (b) refer specifically to microorganisms such as bacteria, yeast and fungi. 
Indeed, Yu (a) and (b) are essentially concerned with the expression of a-l,4-glucan lyase and its 
subsequent use in the large-scale production of AF. 

The skilled person concerned with obtaining a-l,4-glucan lyase in large quantities, for 
use in the large-scale production of AF and compounds such as the antibiotic microthecin, would 
not use plants for such a purpose; and, the transformation of microorganisms fails to teach or 
suggest transformation of plants or parts thereof Further, Yu(a) and (b) are not concemed with 
improving the properties of plants or plant materials by in situ expression of a-l,4-glucan lyase. 
Thus, there is no incentive, teaching, or suggestion in Yu(a) and (b) to express a-l,4-glucan 
lyase in plants or parts thereof to produce antioxidants in situ from a giucan substrate. 

Further, it is asserted by the Office Action that Poulsen, Ishida et al and Perl et al 
merely teach the transformation of potatoes, maize and grapes, respectively. While the 
transformation of plants was within the art, the Office Action has not shown a document which ^ * 
motivates the skilled person to even attempt to express an enzyme, for example a-1 ,4-glucan j^s-U. qj^ 
lyase, in a plant to produce in situ an anti-oxidant, for example AF, from a giucan substrate. 

The state of the law in the Federal Circuit requires that the suggestion or motivation to 
practice the claimed invention must be present in the cited art, and not gleaned from the 
Applicant's specification. Further, picking and choosing portions from a plethora of disparate 
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references in a hindsight attempt to formulate an obviousness rejection is prohibited. In the 

instant case, the only suggestion or motivation to provide for the recombinant expression of [\|o ^ 

glucan-metabolizing enzymes in plants to produce anti-oxidants in situ is found in the " u 

Applicant's disclosure. Thus, the cited documents fail to teach or suggest the instant invention. '^^^^ 

A prima facie case of obviousness has not been made and the rejections of the claims have been ^^^j^^^j 

obviated. 

In addition, the arguments in the Office Action w^ith regard to obviousness, particularly 
w^ith regard to the ease of plant transformation, apparently run counter to the Section 112 ^ 
rejection; and, it is further respectfully asserted that the Examiner cannot have it both ways. 

It is submitted therefore that none of the cited documents, either alone or in combination 
would lead the skilled person to express, in a plant, a recombinant enzyme that acts on a glucan 
substrate to produce an anti-oxidant. Thus, Claims 1-18 are novel and inventive (non-obvious) 
over the cited art. Accordingly, reconsideration and withdrawal of the Section 103 rejection is 
believed to be in order and such action is respectfully requested. 

CONCLUSION 

In view of the remarks and amendments herewith, the application is believed to be in 
condition for allowance. Favorable reconsideration of the application and prompt issuance of a 
Notice of Allowance are earnestly solicited. The undersigned looks forward to hearing 
favorably from the Examiner at an early date. 



Respectfully submitted. 



FROMMER LAWRENCE & HAUG LLP> 
Attpmeys for Applicait 




Thomas J. Kowalski 
Reg. No. 32,147 
Tel: (212) 588-0800 
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Version with Markings to Show Changes Made 



IN THE CLAIMS 

1 . A process for producing an anti-oxidant in a medium [comprising a component] which 
medium comprises fis] a plant or part thereof, and which process [method] comprises expressing 
in the plant [component] or part thereof a recombinant enzyme which acts on a glucan substrate 
present in the medium and/or the component or part thereof, to yield said anti-oxidant. 

2. The [A] process according to claim 1^ wherein the glucan comprises a- 1,4 links. 

3. The [A] process according to claim 2^ wherein the glucan is starch. 

4. The [A] process according to claim 1^ wherein the enzyme is a glucan lyase. 

5. The [ A] process according to claim 4, wherein the enzyme is an a-l,4-glucan lyase. 

6. The [A] process according to claim 5, wherein the enzyme is any one of the sequences shown 
as SEQ ID NOs[Nos] 1-6, or a variant, homologue or fragment thereof 

7. The [ A] process according to claim 6, wherein the enzyme is any one of the sequences shown 
as SEQ IDNOs[Nos] 1-6. 

8. The [A] process according to claim 4^ wherein the enzyme is encoded by a nucleotide 
sequence comprising any one of the sequences shown as SEQ ID NOs[Nos] 7-12, or a variant, 
homolog[ue] or fragment thereof. 

9. The [A] process according to claim 8, wherein the enzyme is encoded by a nucleotide 
sequence having any one of the sequences shown as SEQ ID NOs [Nos] 7-12. 

10. The[A] process according to claim 1^ wherein the anti-oxidant is produced in the plant 
component, or part thereof, and is then released into the medium. 

1 1 . The [A] process according to claim 1^ wherein the anti-oxidant is anhydrofructose. 
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12. The [A] process according to claim 11, wherein the anti-oxidant isl,5-D-anhydrofrutose. 

13. The [A] process according to claim 1^ wherein the medium, is, or is used in the preparation 
of, a foodstuff. 

14. The[A] process according to claim 13, wherein the foodstuff is a beverage. 

15. The [A] process according to claim 14, wherein the beverage is an alcoholic beverage. 

16. The [A] process according to claim 14, wherein the beverage is a wine. 

17. The [A] process according to claim i[13], wherein the plant [component] is all or part of a 
grape. 

18. The [A] process according to claim 17, wherein the plant fcomponent] is all or part of a grape. 

19. [Use of anhydrofructose as] Method of imparting into a medium an anti-oxidant comprising 
the steps of adding into the medium [for a medium comprising] at least one component which is a 
plant or part thereof^ wherein the anti-oxidant is anhydrofructose, and wherein the 
anhydrofructose is prepared in situ in the medium. 

20. [Use of anhydrofructose as a means for] Method of imparting or improving stress tolerance 
in a plant, comprising administering anhydrofructose, wherein the anhydrofructose is prepared in 
situ in the plant. 

2 1 . [Use of anhydrofructose as a means for] Method of imparting or improving the 
transformation of a grape, comprising administering anhydrofructose, wherein the 
anhydrofructose is prepared in situ in the grape. 

22. [Use of glucan lyase as a means for] Method of imparting or im proving stress jolerance in a 
plant, comprising administering glucan lyase, wherein the glucan lyase is prepared in situ in the 
plant. 
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23. [Use of glucan lyase as a means forJ Method of imparting or improving the transformation of 
a grape, comprising administering glucan lyase, wherein the glucan lyase is prepared in situ in 
the grape. 

24. [Use of a nucleotide sequence coding for a glucan lyase as a means for] Method of imparting 
or improving stress tolerance in a plant, comprising adminstering a nucleotide sequence coding 
for a glucan lyase, w^herein the nucleotide sequence is expressed in situ in the plant. 

25. [Use of a nucleotide sequence coding for a glucan lyase as a means forl Method of imparting 
or improving the transformation of a grape, comprising administering a nucleotide sequence 
coding for a glucan lyase, wherein the nucleotide sequence is expressed in situ in the grape. 
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^ADE-^t^^^^'™^^"^^** glucan lyase aminoacid sequences of Sequence ID No. 1 against No.2-6 
Part 1. Summary of the homolodiges of Squence ID No. 1 gainst No.2-6 



Seq. ID. No. 


1 


2 


3 


4 


5 


6 


Homology 
(%) 


100 


77.1 


26.8 


26.8 


74.6 


77.9 



Part 2. Pairwise comparisons of Seq No. 1 against No. 2-6 

1. Sequence Id No. 1 (SINl) aligned with Sequence Id No. 2 (SIN2) 

Sequence 1: SINl, (1088 residues) 
Sequence 2: SIN2, (1091 residues) 

using the parameters: 
Comparison matrix: BLOSUM62 
Number of alignments computed: 20 
Gap open penalty: 12 
Gap extension penalty: 4 

77.1% Identity In 1096 residues overlap; Score: 4534.0; Gap frec[uency: 1.2% 



RECEIVED 

JUL 2 7 2001 
TECH CENTER 1600/2900 



SINl, 
SIN2, 



1 MFSTLAFVAPSALGASTFVGAEV-RSNVRIHSAFPAVHTATRKTNRLNVSMTALSDKQTA 
1 MYPTLTFVAPSALGARTFTCVGIFRSHILIHSVVPAVRLAVRKSNRLNVSMSALFDKPTA 
* ** ********* ** ** *** *** * *+ ******* ** ** ** 



SINl, 
SIN2, 



60 TAGSTDNPDGIDYKTYDYVGVWGFSPLSNTNWFAAGSSTPGGITDWTATMNVNFDRIDNP 

61 VTGGKDNPDNINYTTYDYVPVWRFDPLSNTNWFAAGSSTPGDIDDWTATMNVNFDRIDNP 

* **** * * ★* * **************** *'■** *^ ************ * 



SINl, 
SIN2, 



120 SITVQHPVQVQVTSYNNNSYRVRFNPDGPIRDVTRGPILKQQLDWIRTQELSEGCDPGMT 

121 SFTLEKPVQVQVTSYKNNCFRVRFNPDGPIRDVDRGPILQQQLNWIRKQEQSKGFDPKMG 
* * ********* ** ************* ***** *** *** ** * * ** * 



SINl, 
SIN2, 



180 FTSEGFLTFETKDLSVI I YGNFKTRVTRKSDGKVIMENDEVGTASSGNKCRGLMFVDRLY 

181 FTKEGFLKFETKDLNVI I YGNFKTRVTRKRDGKGIMENNEVPAGSLGNKCRGLMFVDRL Y 
** **** ****** ************** *** **** ** * *"*' *'* ********** 



SINl, 
SIN2, 



240 GNAIASVNKNFRNDAVKQEGFYGAGEVNCKYQDT YILERTGI-AMTNYNYDNLNY 

241 GTAIASVNENYRNDPDRKEGFYGAGEVNCEFWDSEQNRNKYILERTGIAMTNYNYDNYNY 
* ****** * *** *********** * ***************** ** 



SINl, 
SIN2, 



294 NQWDLRPPHHDGALNPDYYIPMYYAAPWLIVNGCAGTS-EQYSYGWFMDNVSQSYMNTGD 
301 NQSDLIAPGYPS — DPNFYI PMYFAAPWVVVKGCSGNSDEQYSYGWFMDNVSQTYMNTGG 
** ** * * ***** **** * ** * * ************** ***** 



SINl, 
SIN2, 



353 TTWNSGQEDLAYMGAQYGPFDQHFVYGAGGGMECVVTAFSLLQGKEFENQVLNKRSVMPP 
359 TSWNCGEENLAYMGAQCGPFDQHFVYGDGDGLEDVVQAFSLLQGKEFENQVLNKRAVMPP 
* ** * * ******* ********** * * * ** ****************** **** 



SINl, 
SIN2, 



413 KYVFGFFQGVFGTSSLLRAHMPAGENNISVEEIVEGYQNNNFPFEGLAVDVDMQDNLRVF 
419 KYVFGYFQGVFGIASLLREQRPEGGNNISVQEIVEGYQSNNFPLEGLAVDVDMQQDLRVF 
***** ****** **** * * ***** ******* **** ********** **** 



SINl, 
SIN2, 



4 73 TTKGEFWTANRVGTGGDPNNRSVFEWAHDKGLVCQTNITCFLRNDNEGQDYEVNQTLRER 

479 TTKIEFWTANKVGTGGDSNNKSVFEWAHDKGLVCQTNVTCFLRNDNGGADYEVNQTLREK 
*** ****** ****** ** **************** ******** * ********** 



SINl, 
SIN2, 



533 QLYTKNDSLTGTDFGMTDDGPSDAYIGHLDYGGGVECDALFPDWGRPDVAEWWGNNYKKL 

539 GLYTKNDSLTNTNFGTTNDGPSDAYIGHLDYGGGGNCDALFPDWGRPGVAEWWGDNYSKL 
********* * ** * **************** *********** ****** ** ** 



SINl, 
SIN2, 



593 FSIGLDFVWQDMTVPAMMPHKIGDDINVKPDGNWPNADDPSNGQYNWKTYHPQVLVTDMR 
599 FKIGLDFVWQDMTVPAMMPHKVGDAVDTRSPYGWPNENDPSNGRYNWKSYHPQVLVTDMR 
* ******************* ** *** ***** **** *********** 



SINl, 
SIN2, 



653 YENHGREPMVTQRNIHAYTLCESTRKEGIVENADTLTKFRRSYIISRGGYIGNQHFGGMW 
659 YENHGREPMFTQRNMHAYTLCESTRKEGIVANADTLTKFRRSYIISRGGYIGNQHFGGMW 



SINl , 713 VGDNSTTSNYIQMMI ANNINMNMSCLPLVGSDIGGFTSYDNENQRTPCTGDLMVRYVQAG 
SIN2 , 719 VGDNSSSQRYLQMMIANIVNMNMSCLPLVGSDIGGFTSYDGRNV CPGDLMVRFVQAG 

SINl, 773 CLLPWFRNHYDRWIESKDHGKDYQELYMYPNEMDTLRKFVEFRYRWQEVLYTAMYQNAAF 

S I N2 , 7 7 6 CLLPWFRNHYGRLVEGKQEGKYYQELYMYKDEMATLRKFI EFRYRWQEVL YTAMYQNAAF 

********** * * * ** **■ + *^ ******** * 

SINl, 833 GKPIIKAASMYNNDSNVRRAQNDHFLLGGHDGYRILCAPVVWENSTERELYLPVLTQWYK 

SIN2, 836 GKPIIKAASMYDNDRNVRGAQDDHFLLGGHDGYRILCAPVVWENTTSRDLYLPVLTKWYK 

★ ********** *■* *** ** **** ***************** * * * ******* *** 

SINl , 8 93 FGPDFDTKPLEGAMNGGDRI YNYPVPQSESPIFVREGAILPTRYTLNGENKSLNTYTDED 

SIN2, 8 96 FGPDYDTKRLDSALDGGQMIKNYSVPQSDSPIFVREGAILPTRYTLDGSNKSMNTYTDKD 

**** *** * * ** * ** **** ***************** * *** ***** * 

SINl, 953 PLVFEVFPLGNNF^DGMCYLDDGGVTTNAEDNGKFSVVKVAAEQDGGTETITFTNDCYEY 

SIN2, 95 6 PLVFEVFPLGNNRADGMCYLDDGGITTDAEDHGKFSVINVEALRKGVTTTIKFAYDTYQY 

************************ ** *** ***** * * * * ** * * * * 

SINl, 1013 VFGGPFYVRVRGAQSPSNIHVSSGAGSQDMKVSSATSRAALFNDGENGDFWVDQETDSLW 

S IN2 , 1016 VFDGPFYVRIRNLTTASKINVSSGAGEEDMTPTS ANSRAALFSDGGVGEYWADNDTSSLW 

****** * * * ****** ** ** ****** ** * * * * *** 

SINl, 1073 LKLPNVVLPDAVITIT 

SIN2, 1076 MKLPNLVLQDAVITIT - - - 

**** ** ******* 

2. Sequence Id No. 1 (SINl) aligned with Sequence Id No. 3 (SINS) 

Identity : 286 (26.83%) 

Similarity: 166 (15.57%) 

Number of gaps inserted in GL1FULL: 59 

Number of gaps inserted in MCSYl; 67 

GL1FULL (=SIN1). Total number of residues: 1088. 

MCSYl. (-SIN3). Total number of residues: 1066. 

Comparison matrix : Structure-genetic matrix. 
Open gap cost : 2 
Unit gap cost : 2 

The character to show that two aligned residues are identical is '|' 
The character to show that two aligned residues are similar is '.' 
Amino acids said to be 'similar' are: A.S.T; D,E; N,Q; R,K; l,L,M,V; F,Y,W 



GL1FULL - M— FS-TL7\FVAPSALGASTFVGAEVRSNVRIHSAFPAVHTATRKTNRLN -47 

MCSYl - MAGFSDPLNFCKAEDYYS— -VALDWKGPQKI— l-GVDTTPPKSTKFP -43 

GL1FULL - VSMTALSDKQTATAGSTDNPDGIDY-KTYDYVGVWG— FSP-LSNTNWFA -93 

MCSYl - KNWHGVNLRF-— DDGT— LGWQFIRPC VWRVRYDPGFKTSDEYG -84 

GL1FULL - AGSSTPGGITDWTATMNVNFDRIDN-PSITVQHPVQVQVTSYNNNS-YRV -141 

MCSYl - DENTRTI-VQDYMSTLS-— NKLDTYRGLTWE TKCEDSGOF-F -122 

GL1FULL - RFNPDGPIRDVTRGPILKQQL-DWIRTQELSEGCDPGMTFTSEGFLTFET -190 

I . |. .. .|.| I II 

MCSYl - TFS— SKVTAVEKSERTRNKVGDGLRIH-LWKS P-FRIQWRTLTP -154 

GL1FULL - -KDLSVIIYGNFKTRVTRKSDGKVIMENDEVGTASSGNKCRGLMFVDRLY -239 



'J 



MCSY1 - LKDPYPI— PNVAAAEARVSD-KV VWQTSP— K Tr--R 195 

GLIFULL - GNAIASVNKNFRN-— DAVK—QEGFYGAGEVNCKYQDTYILERTGIAMT -284 

- I . I II ■ I II. I ■■ I II 
MCSY1 - -KNLHPQHKMLKDTVLDIVKPGHGEYVGWGEMGG-IQ— FMKEPT-F-MN -239 

GL1FULL - NYNYDNLNYNOWDLRPPHHDGALNP-DYYIPMYYAAPWLI-VNGCAGTSE -332 

.|.||..|.| . Ill . |,| . j. . II I 
MCSY1 - YFNFDNMQYQQ— VYA— QGALOSRE— PLYHSDPFYLDVNS— NPE -278 

GLIFULL - OYSYGWFMDNVSQSYMNT-GDTTWNSGQEDLAY— MGAQYGPFDQHFVYG -379 

I II .|. II I . 

MCSY1 - H KNITATFIDNYSQIAIDFGKTNSGYIKLGTRYGGID-CYGIS -320 

GLIFULL - AGGGMECWTAFSLLQGKEFENQVLNKRSVMPPKYVFGFFQGVFGTSSLL -429 

MCSYl - ADTVPE-IVRLYTGLVGR— SKL— K PRYILGAHQACYGYQQ 357 

GLIFULL - RAHMPAGENNI-SVEEIVEGYQNNNFPFEGLAVDVDMQDNLRVFnKGEF -478 

I ■ II I I II \- IIINI I III 
MCSY1 ESDLYSV— VQQYRDCKFPLDGIHVDVDVQDGFRTFTTN 394 

GLIFULL - WTANRVGTGGDPNNRSVFE-WAHDKGLVCQTNITCFLRNON-EGQDY 523 

II ■ -I M nil - I II I 

MCSYl PHTF PNPKEMFTNL-RNNGIKCSTNITPVISINNREG-GYSTL -435 

GLIFULL - -E-VNQT— LRERQLYTKNDSLTGTD FG— MTD-DGPSD -555 

II ..nil -I III 
MCSYl - LEGVDKKYFIMD-DRYTEGTSGNAKDVRYMYYGGGNKVEVDPNDVNGRPD -484 

GL1FULL AYIGHLD-— YG— GGV ECDALFPDWGRPDVAEWWGNNY -589 

I III .11 I .1 III -1 

MCSYl - FKDNYDFPANFNSKQYPYHGGVSYGYGNGSAGFYPDLNRKEVRIWWGMQY -534 

GLIFULL - KKLFSIGLDFVWQDMTVPAMMPHK-IGDDINVKPDGNWPN— -ADDPSNG -635 

I II .11.1111 . I II II I 
MCSYl - KYLFDMGLEFVWQDMT— TPAIHTSYGD MKG-LPTRLLV--TSDS -574 

GL1FULL - QYNWKTYHPQVLVTDMRYENHGREPM\/TQRNIHAYTLCESTRKEGI--VE -683 

MCSYl VTNA— S-EKKL7\IETWALYSYNLHKATWH-GLSRLE -607 

GLIFULL - NADTLTKFRRSYIISRGGYIGNQHFGGMWVGDNSnSNYIQMMIANNINM -733 

M-j. INI IN III 

MCSYl - S RKNKRNFILGRGSYAGAYRFAGLWTGDNASNWEFWKISVSQVLSL -653 

GLIFULL - NMSCLPLVGSDIGGFTSYDNEN--QRTPCTGDLMVRYVQAGCLLPWFRNH -781 

. - . Ill III I I |. .|..|. nil III 
MCSYl - GLNGVCIAGSDTGGFEPYRDANGVEEKYCSPELLIRWYTGSFLLPWLRNH -703 

GLIFULL - YDRWIESKDHGKDYQELYMYPN— E— MD TL RKFVEF -814 

I . II I .11 I II I ■ I .11 

MCSY1 ■- YVK KDR-KWFQEPYSYPKHLETHPELADQAWLYKSVLEICRYYVEL -748 

GLIFULL - RYRWQEVLYTAMYQNAAFGKPIIKAASMYNN-DSNVRRAQNDHFL L -859 

II .11 l-ll I II .. . |. II . 

MCSYl - RYSLIQLLYDCMFQNWDGMPITRSMLLTOTEDTTFF-NESQKFLDNQYM -797 



GL1FULL - GGHDGYRILCAPWWENS-TE RELYLPVLTQWY— KFGP-DFDT- -899 

II III!.- I I IJ. II III 
MCSY1 - AGDD— ILVAPIL— HSRKEIPGENRDWLPLYHTWYPSNLRPWD-DQG -841 

GL1FULL KPLEGAM— NGGDRIY— NYPVPQSESPIFVREGAILPTRYTLNG -940 

Nl I II I - I IJIIN . 
MCSY1 - VALGNPVEGGSVINYTARIVAPEDYNLFHSWPVYVREGAIIPQ— -I 885 

GL1FULL - ENKSLNTYTDEDPLVFEVFPLGNNRADGMCYLODGGVnNA— EDNGKFS -988 

I . ■ I .-I I . mil I. ■ II 

MCSY1 - EVRQWTGQGGANRIKFNIYP-GKDKEYCT-YLDDG-VSRDSAPED-— LP -930 

GL1FULL - WKVAAEQD— GGTETIT— F-TNDCYEYVFG-GP FYVRVRGAQSP -1028 

I . II |.| I. I . I I . -I .|. 

MCSY1 - QYKETHEQSKVEGAE-IAKQIGKKTGYN-ISGTDPEAKGYHRKVAVTQTS -978 

GLIFULL - SNIHVSSGAGSQDMKVSS-ATSRAA L FNDG 1057 

MCSY1 - KD-— KTRTVTIEPKHNGYDPSKEVGDYYTIILWYAPGFDGSIVDVSKTT -1025 

GLIFULL - ENGDFWVDQE TD— SLWLKLPNWL— PDAV-ITIT -1088 

I . |. .1 .... |. .1 II I 
MCSY1 - VNVEGGVEHQVYKNSDLHTWIDV-KEVIGnKSVKITCTAA -1066 



3. Sequence Id No. 1 (SINl) aligned with Sequence Id No. 4 (SIN4) 

Identity : 287 (26.82%) 

Similarity: 160 (14.95%) 

Number of gaps inserted in GLIFULL: 64 

Nunnber of gaps inserted in MVSY1: 63 

♦ ALIGNMENT OF TWO PROTEIN SEQUENCES. * 

The two sequences to be aligned ore: 

GLIFULL (=SIN1). Totol number of residues: 1088. 
MVSY1 (-SIN4). Total number of residues: 1070. 

Comparison matrix : Structure-genetic matrix. 
Open gap cost : 2 
Unit gap cost : 2 

The ctiorocter to stiow that two aligned residues ore identicol is '|' 
The character to show that two aligned residues are similar is '.' 
Amino acids said to be 'similar' are: A,S,T; D,E; N,Q; R,K; l,L,M,V; F,Y,W 



GLIFULL - MFSTLAFVAPSALGAS-TFVGAEVRSNVRIHSAFPAVH— TATRKTNRLN -47 

I .1 I II . I . I I 

MVSY1 - M — AGLSDPLNFCKAE— DY YAAAKGWSGPQKIIRYD -33 

GLIFULL - VSMTALSDKQTATAGSTDN-P— DGIDYKTYDYVG— VWG— FSP-LSNT -89 

• I . . I I II .1 II . I . . 
MVSY1 - QTP-PQGTKD-PKSWHAVNLPFDDGTMCW-QFVRPCVWRVRYOPSVKTS -80 

GLIFULL - NWFAAGSSTPGGITDWTATMNVNFDRIDNPSITVQHPVQVQVTSYNNNS- -138 

. . . |. .[. I I .1 |.. 
MVSY1 - DEYGDENTRTI-VQDYMnLVGNLDIF— RGLTW VSTLEOSGE -120 



GL1FULL - YRVRFNPDGPIRDVTRGPILKQQLDWIRTQELSEGCDPGMTFTSEGFL 186 

11-11 - II \l ] 
MVSYl - Y-YTFKSEVTAVDET E— RTR— NKVGD-GLKI YLWK -151 

GL1FULL - -TFETKDLSVIIYGNFKTRVTRKSDGKVIMENDEVGTASSGNKCRGLMFV -235 

I . I . I I I II.. I 

MVSYl - NPFRIQWRLL TPLV— DPFPIP-N— VANATA R— VA -182 

GLIFULL - DRLYGNAIA-SVNKNFR— NDAVKQEGF— YGAGEVNCKYQDTYIL— ER -278 

I II .|. I |. I 

MVSYl - DKWWQTSPKTFRKNLHPQHKMLKDTVLDIIKPG HGEYVGWGEM -226 

GLIFULL - TGIA MT— NY-NYDNLNYNQWDLRPPHHDGALNP-DYYIPMYYAAP -320 

II I II l-l|..|.| . - III . |.| . I 
MVSYl - GGIEFMKEPTFMNYFNFDNMQYQQ— VYA— QGALDSRE— PLYHSDP -268 

GLIFULL - WLI-VNGCAGTSEQYSYGWFMDNVSOSYMNT-GDnWNSGQEDLAY— MG -366 

• II I I II .1 

MVSYl - FYLDVNS— -NPEH KNITATFIDNYSQIAIDFGKTNSGYIKLG -308 

GLIFULL - AQYGPFDQHFVYGAGGGMECWTAFSLLQGKEFENQVLNKRSVMPPKYVF -416 

. II I . I I .1 .. I I- .1 I.|. 
MVSYl - TRYGGID-CYGISADTVPE-IVRLYTGLVGR— SKL— K— — PRYIL -346 

GLIFULL - GFFQGVFGTSSLLRAHMPAGENNISVEEIVEGYQNNNFPFEGLAVDVDMQ -466 

I M I . .1 I II -W nil I 

MVSYl - GAHQACYGYQQ ESDLHA— WQQYRDTKFPLDGLHVDVDFQ -385 

GLIFULL - DNLRVFTTKG-EFWTANRVGTGGDPNNRSVFEWAHDKGLVCOTNITCFL- -514 

II I III II - ■ M III! . 

MVSYl - DNFRTFTTNPITF— PNPK EMFT-NL— — RNNGIKCSTNITPVIS -424 

GLIFULL - -RIM— D NEGQD— Y EVN QTLR ERQL-Y -535 

I III I I I I .1 

MVSYl - IRDRPNGYSTLNEGYDKKYFIMDDRYTEGTSGDPQNVRYSFYGGGNPVEV -474 

GLIFULL - TKNDSLTGTDFG MTD DGPSDAYIGHLDYGGGVECDALFPDW -576 

II . Ill I II . II I -W 

MVSYl - NPNDVWARPDFGDNYDFPTNFNCKDYPYHGGVSY-GYGNGTP— GYYPDL -521 

GLIFULL - GRPDVAEWWGNNYKKLFSIGLDFVWQDMTVPAMMPHKIGDDINVKPDGNW -626 

I .1 III ] II illlllll II I 
MVSYl - NREEVRIWWGLQYEYLFNMGLEFVWQDMHPA IH SSY -558 

GLIFULL - PNADDPSNGQYNWKTTHPQVLVTDMRYENHG-REPMVTQRNIHAYTLCES -675 

I I .III I . . . .11 . 
MVSYl GDM-KG LPTRLLVTADSVTNASEKKU\IESWALYSYNLHKA -598 

GLIFULL - TRKE-GIVENADTLTKFRRSYIISRGGYIGNQHFGGMWVGDNSTTSNYIQ -724 

I I .1 I .1 -Ml I I I 1-1 III..I - 
MVSYl - TFHGLGRLES RKNKRNFILGRGSYAGAYRFAGLWTGDNASTWEFWK -644 

GL1FULL - MMIANNINMNMSCLPLVGSDIGGFTSYDNE-NQRTPCTGDLMVRYVQAGC -773 

mill I I. -U. 

MVSYl - ISVSQVLSLGLNGVCIAGSDTGGFEPARTEIGEEKYCSPELLIRWYTGSF -694 
GLIFULL - LLPWFRNHYDRWiESKDHGKDYQELYMYPNEMOT— L R 809 

llll nil . II I .11 I II ..| I 



WVSY1 - LLPWLRNHYVK KDR-KWFQEPYAYPKHLETHPELADQAWLYKSVLE -739 

GLIFULL KF-VEFRYRWQEVLYTAMYQNAA FGK-PIIKAA— SMYNNDSNV -849 

.. II II .11 l-ll I.|. 

MVSY1 - ICRYWVELRYSLIQLLYDCMFQNWDGMPLARSMLLTDTEDTTFFNESQ- -788 

GLIFULL - RRAQNDHFLLGGHDGYRILCAPWWENS-TE RELYLPVLTQWY 891 

■ I I II II- I I II 
MVSY1 - -KFLDNQYM-AGDD— ILVAPIL— HSRNEVPGENROVYLPLFHTWYPS -831 

GL1FULL - KFGP-DFDT KPLEGAM— NGGDRIY— NYPVPQSESPIFVREGA -930 

III l\\ I II |. Nlll 
MVSYl - NLRPWD-DQGVALGNPVEGGSVINYTARIVAPEDYNLFHNWPVYIREGA -880 

GLIFULL - ILPT-— RYTLNGENKSLNTYTDEDPLVFEVFPLGNNRAOGMCYLDOGGV -977 

MVSYl - IIPQIQVR-QWIGE— G GPNPIKFNIYP-GKDKEYVT-YLDDG-V -919 

GLIFULL - TTNAEDNGKFSWKVAAEQD— GGTET— ITF TNDCYEYVFGGP -1017 

MVSYl - SRDSAPDD-LPQYREAYEQAKVEGKDVQKQL^VIQGNKTND FSAS -963 

GLIFULL - FYVR-VRGAQSPSNI-HVS— SGAGSQDMKVSS-ATSRAA LF -1054 

MVSYl - GIDKEAKGYHRKVSIKQESKDKTRWTIEPKHNGYDPSKEVGNYYTIILW -1013 

GLIFULL - NDGE-NGDFWVD— QETDSL— WLK LPNWLPDA— Vl-T 1086 

I III I . . I II. II I 

MVSYl - YAPGFDGSI-VDVSQATVNIEGGVECEIFKNTGLHTVW-NVKEVIGTTK -1061 

GLIFULL IT -1088 

MVSYl - SVKITCHA -1070 



4. Sequence Id No. 1 (SINl) aligned with Sequence Id No. 5 (SINS) 

Sequence 1: SINl, (1088 residues) 
Sequence 2: SINS, (1092 residues) 

using the parameters : 
Comparison matrix: BLOSUM62 
Number of alignments computed: 20 
Gap open penalty: 12 
Gap extension penalty: 4 

74.6% identity In 1096 residues overlap; Score: 4455.0; Gap frequency: 1.3% 

SINl, 1 MFSTLAFVAPSALGASTFVGAEVRSNVRIHSAFPAVHTATRKTNRLNVSMTALSDKQTAT 

SINS, 1 MFPTLTFIAPSALAASTFVGADIRSGIRIQSALPAVRNAVRRSKHYNVSMTALSDKQTAI 

** *+ * ***** ******* +* ** ** *** * * ************* 

SINl, 61 AGSTDNPDGIDYKTYDYVGVWGFSPLSNTNWFAAGSSTPGGITDWTATMNVNFDRIDNPS 

SINS, 61 SIGPDNPDGINYQNYDYIPVAGFTPLSNTNWYAAGSSTPGGITDWTATMNVKFDRIDNPS 

****** * *** * ** ******* ******************* ******** 

SINl, 121 ITVQHPVQVQVTSYNNNSYRVRFNPDGPIRDVTRGPILKQQLDWIRTQELSEGCDPGMTF 

SINS, 121 YSNNHPVQIQVTSYNNNSFRIRFNPDGPIRDVSRGPILKQQLTWIRNQELAQGCNPNMSF 

**** ********* * *********** ********* *** *** ** * * * 

SINl, 181 TSEGFLTFETKDLSVIIYGNFKTRVTRKSDGKVIMENDEVGTASSGNKCRGLMFVDRLYG 

SINS, 181 SPEGFLSFETKDLNVIIYGNCKMRVTKK-DGYLVMENDECNSQSDGNKCRGLMYVDRLYG 

**** ****** ****** * *** * ** ***** * ******** ****** 

SINl, 241 NAIASVNKNFRNDAVKQEGFYGAGEVNCKYQD TYILERTGIAMTNYNYDNLNYNQ 

SINS, 240 NAIASVQTNFHKDTSRNEKFYGAGEVNCRYEEQGKAPTYVLERSGLAMTNYNYDNLNYNQ 



SINl, 
SINS, 



296 WDLRPPHHDGALNPDYYIPMYYAAPWLIVNGCAGTSEQYSYGWFMDNVSQSYMNTGDTTW 
300 PDVVPPGYPD--HPNYYIPMYYAAPWLVVQGCAGTSKQYSYGWFMDNVSQSYMNTGDTAW 



SINl, 
SINS, 



356 NSGQEDLAYMGAQYGPFDQHFVYGAGGGMECVVTAFSLLQGKEFENQVLNKRSVMPPKYV 
358 NCGQENLAYMGAQYGPFDQHFVYGDGDGLEDVVKAFSFLQGKEFEDKKLNKRSVMPPKYV 



SINl, 
SINS, 



SINl, 
SINS, 



SINl, 
SINS, 



416 FGFFQGVFGTSSLLRAHMPAGENNISVEEIVEGYQNNNFPFEGLAVDVDMQDNLRVFTTK 
418 FGFFQGVFGALSLLKQNLPAGENNISVQEIVEGYQDNDYPFEGLAVDVDMQDDLRVFTTK 
*★***★*★* *** ********* ******* * ************* ******* 

47 6 GEFWTANRVGTGGDPNNRSVFEWAHDKGLVCQTNITCFLRNDNEGQDYEVNQTLRERQLY 
47 8 PEYWSANMVGEGGDPNNRS VFEWAHDRGLVCQTNVTCFLRNDNSGKPYEVNQTLREKQLY 
* * ** ** *************** ******* ******** * ********* *** 

536 TKNDSLTGTDFGMTDDGPSDAYIGHLDYGGGVECDALFPDWGRPDVAEWWGNNYKKLFSI 
538 TKNDSLNNTDFGTTSDGPGDAYIGHLDYGGGVECDAIFPDWGRPDVAQWWGENYKKLFSI 
****** **** * *** ***************** ********** *** ******** 



SINl, 
SINS, 



5 9 6 GLDFVWQDMTVPAMMPHKI GDDI NVKPDGN WPNADDPSNGQYNWKTYHPQVLVTDMR 

598 GLDFVWQDMTVPAMMPHRLGDAVNKNSGSSAPGWPNENDPSNGRYNWKSYHPQVLVTDMR 
***************** ** * *** ***** **** *********** 



SINl, 
SINS, 



SINl, 
SINS, 



65 3 Y-ENHGREPMVTQRNIHAYTLCESTRKEGIVENADTLTKFRRSYIISRGGYIGNQHFGGM 
65 8 YGAEYGREPMVSQRNIHAYTLCESTRREGIVGNADSLTKFRRSYIISRGGYIGNQHFGGM 
* ****** ************** **** *** ************************ 

712 WVGDNSTTSNYIQMMIANNINMNMSCLPLVGSDIGGFTSYDNENQRTPCTGDLMVRYVQA 
718 WVGDNSATES YLQMMLANI INMNMSCLPLVGSDIGGFTQYNDAGDPTP — EDLMVRFVQA 
****** * * *** ** ******************* * ** ***** *** 



SINl, 
SINS, 



772 GCLLPWFRNHYDRWIESKDHGKDYQELYMYPNEMDTLRKFVEFRYRWQEVLYTAMYQNAA 
77 6 GCLLPWFRNHYDRWIESKKHGKKYQELYMYPGQKDTLKKFVEFRYRWQEVLYTAMYQNAT 
****************** *** ******** *** ********************* 



SINl, 
SINS, 



SINl, 
SINS, 



832 FGKPIIKAASMYNNDSNVRRAQNDHFLLGGHDGYRILCAPVVWENSTERELYLPVLTQWY 
83 6 TGEPIIKAAPMYNNDVNVYKSQNDHFLLGGHDGYRILCAPVVRENATSREVYLPVYSKWF 
* ****** ***** ** ********************* ** * ** **** * 

892 KFGPDFDTKPLEGAMNGGDRI YNYPVPQSES P I FVREGAI LPTRYTLNGENKSLNTYTDE 
89 6 KFGPDFDTKPLENEIQGGQTLYNYAAPLNDSPIFVREGTILPTRYTLDGVNKSINTYTDN 
************ ** *** * ******** ******** * *** ***** 



SINl, 
SINS, 



952 DPLVFEVFPLGNNRADGMCYLDDGGVTTNAEDNGKFSVVKVAAEQDGGTETITFTNDCYE 
95 6 DPLVFELFPLENNQAHGLFYHDDGGVTTNAEDFGKYSVISVKAAQEGSQMSVKFDNEVYE 
****** *** ** * * * *********** ** ** * * * * * * ** 



SINl, 
SINS, 



1012 YVFGGPFYVRVRGAQSPSNIHVSSGAGSQDMKVSSATSRAALFNDGENGDFWVDQETDSL 
1016 HQWGASFYVRVRNMGAPSNINVSSQIGQQDMQQSSVSSRAQMFTSANDGEYWVDQSTNSL 
* ****** **** *** * *** ** *** * * **** * ** 



SINl, 
SINS, 



1072 WLKLPNVVLPDAVITI 
107 6 WLKLPGAVIQDAAITV 
***** * ** ** 



5. Sequence Id No. 1 (SINl) aligned with Sequence Id No, 6 (SIN6) 



Sequence 1: SINl, (1088 residues) 
Sequence 2: SIN6, (570 residues) 



using the parameters: 
Comparison matrix: BLOSUM62 
Number of alignments computed: 2 0 
Gap open penalty: 12 
Gap extension penalty: 4 

77.9% identity in 578 residues overlap; Score: 2451.0; Gap frequency: 1.9% 



SINl, 
SIN6, 



28 3 MTNYNYDNLNYNQWDLRPPHHDGALNPDYYIPMYYAAPWLIVNGCAGTSEQYSYGWFMDN 
1 MTNYNYDNLNYNQPDLIPPGHDS — DPDYYI PMYFAAPWVIAHGYRGTSDQYSYGWFLDN 



k- *********** * ** ** ** 



t******* **** * * *** ******* ** 



SINl, 
SIN6, 



34 3 VSQSYMNTGDTTWNSGQEDLAYMGAQYGPFDQHFVYGAGGGMECVVTAFSLLQGKEFENQ 
S9 VSQSYTNTGDDAW-AGQKDLAYMGAQCGPFDQHFVYEAGDGLEDVVTAFSYLQGKEYENQ 



tr * * * * * 



t* ******** 



t******* ** * * ****** ***** *** 



SINl, 



40 3 VLNKRSVMPPKYVFGFFQGVFGTSSLLRAHMPAGENNISVEEIVEGYQNNNFPFEGLAVD 



118 GLNIRSAMPPKYVFGFFQGVFGATSLLRDNLPAGENNVSLEEIVEGYQNQNVPFEGLAVD 
** ****** * ********* * ********* 

4 63 VDMQDNLRVFTTKGEFWTANRVGTGGDPNNRSVFEWAHDKGLVCQTNITCFLRNDNEGQD 
178 VDMQDDLRVFTTRPAFWTANKVGEGGDPNNKSVFEWAHDRGLVCQTNVTCFLK — NEKNP 
***** ****** ******* ****** ******** ******* **** ** 

523 YEVNQTLRERQLYTKNDSLTGTDFGMTDDGPSDAYIGHLDYGGGVECDALFPDWGRPDVA 
236 YEVNQSLREKQLYTKSDSLDNIDFGTTPDGPSDAYIGHLDYGGGVECDALFPDWGRPDVA 
***** **★ ***** *** *** * ******************************** 

58 3 EWWGNNYKKLFSIGLDFVWQDMTVPAMMPHKIGDDINVKPDGN WPNADDPSNGQYNW 

2 96 QWWGDNYKKLFSIGLDFVWQDMTVPAMMPHRLGDPVGTNSGETAPGWPNDKDPSNGRYNW 
*** ************************* ** *** ***** *** 

64 0 KTYHPQVLVTDMRYENHGREPMVTQRNIHAYTLCESTRKEGIVENADTLTKFRRSYIISR 
35 6 KSYHPQVLVTDMRYDDYGRDPIVTQRNLHAYTLCESTRREGIVGNADSLTKFRRSYIISR 
* ************ ** * ***** ********** **** *** ************ 

7 00 GGYIGNQHFGGMWVGDNSTTSNYIQMMIANNINMNMSCLPLVGSDIGGFTSYDNENQRTP 

416 GGYIGNQHFGGMWVGDNSSTEDYLAMMVINVINMNMSGVPLVGSDIGGFTEHDKRN P 

****************** * * ** * ****** *********** * * * 

7 60 CTGDLMVRYVQAGCLLPWFRNHYDRWIESKDHGKDYQELYMYPNEMDTLRKFVEFRYRWQ 
473 CTPDLMMRFVQAGCLLPWFRNHYDRWIESKKHGKNYQELYMYRDHLDALRSFVELRYRWQ 
** *** * ********************* *** ******* * ** *** ***** 

820 EVLYTAMYQNAAFGKPIIKAASMYNNDSNVRRAQNDHF 
533 EVLYTT^YQNALNGKPIIKTVSMYNNDMNVKDAQNDHF 



Comparison of glucan lyase aminoacid sequences of Sequence ID No. 1 against No,2-6 
Part 1. Summary of the homolodiges of Squence ID No. 1 gainst No. 2-6 



Seq. ID. No. 


1 


2 


3 


4 


5 


6 


Homology 


100 


77.1 


26.8 


26.8 


74.6 


77.9 


(%) 















Part 2, Pairwise comparisons of Seq No. 1 against No. 2-6 

1. Sequence Id No, 1 (SINl) aligned with Sequence Id No. 2 (SIN2) 

Sequence 1: SINl, (1088 residues) 
Sequence 2: SIN2, (1091 residues) 

using the parameters : 
Comparison matrix: BLOSUM62 
Number of alignments computed: 2 0 
Gap open penalty: 12 
Gap extension penalty: 4 

77,1% identity in 1096 residues overlap; Score: 4534.0; Gap frequency: 1.2% 



SINl, 
SIN2, 



1 MFSTLAFVAPSALGASTFVGAEV-RSNVRIHSAFPAVHTATRKTNRLNVSMTALSDKQTA 
1 MYPTLTFVAPSALGARTFTCVGIFRSHILIHSVVPAVRLAVRKSNRLNVSMSALFDKPTA 



SINl, 
SIN2, 



60 TAGSTDNPDGIDYKTYDYVGVWGFSPLSNTNWFAAGSSTPGGITDWTATMNVNFDRIDNP 

61 VTGGKDNPDNINYTTYDYVPVWRFDPLSNTNWFAAGSSTPGDIDDWTATMNVNFDRIDNP 

★ ★ ★ * * * ■*■ ** * ***************** * 



SINl, 
SIN2, 



120 SITVQHPVQVQVTSYNNNSYRVRFNPDGPIRDVTRGPILKQQLDWIRTQELSEGCDPGMT 

121 SFTLEKPVQVQVTSYKNNCFRVRFNPDGPIRDVDRGPILQQQLNWIRKQEQSKGFDPKMG 
* * ***** *** *** ** ★ * ** * 



SINl, 
SIN2, 



180 FTSEGFLTFETKDLSVIIYGNFKTRVTRKSDGKVIMENDEVGTASSGNKCRGLMFVDRLY 

181 FTKEGFLKFETKDLNVIIYGNFKTRVTRKRDGKGIMENNEVPAGSLGNKCRGLMFVDRLY 
** **** ************** *** **** ** * ************** 



SINl, 
SIN2, 



24 0 GNAIASVNKNFRNDAVKQEGFYGAGEVNCKYQDT YILERTGITUyiTNYNYDNLNY 

241 GTAIASVNENYRNDPDRKEGFYGAGEVNCEFWDSEQNRNKYILERTGIAMTNYNYDNYNY 
* ****** * *** *********** * ***************** ** 



SINl, 
SIN2, 



294 NQWDLRPPHHDGALNPDYYIPMYYAAPWLIVNGCAGTS-EQYSYGWFMDNVSQSYMNTGD 
301 NQSDLIAPGYPS — DPNFYI PMYFAAPWVVVKGCSGNSDEQYSYGWFMDNVSQTYMNTGG 
* ***** **** * ** * * ************** ***** 



* * * * 



SINl, 
SIN2, 



353 TTWNSGQEDLAYMGAQYGPFDQHFVYGAGGGMECVVTAFSLLQGKEFENQVLNKRSVMPP 
35 9 TSWNCGEENLAYMGAQCGPFDQHFVYGDGDGLEDVVQAFSLLQGKEFENQVLNKRAVMPP 
* ** * * ******* ********** * * * ** ****************** **** 



SINl, 
SIN2, 



SINl, 
SIN2, 



413 KYVFGFFQGVFGTSSLLRAHMPAGENNISVEEIVEGYQNNNFPFEGLAVDVDMQDNLRVF 
419 KYVFGYFQGVFGIASLLREQRPEGGNNI SVQEI VEGYQSNNFPLEGLAVDVDMQQDLRVF 
***** ****** **** * * ***** ******* **** ********** * * * •* 

4 73 TTKGEFWTANRVGTGGDPNNRSVFEWAHDKGLVCQTNITCFLRNDNEGQDYEVNQTLRER 
47 9 TTKIEFWTANKVGTGGDSNNKSVFEWAHDKGLVCQTNVTCFLRNDNGGADYEVNQTLREK 
*** ****** ****** ** **************** ******** * ********** 



SINl, 
SIN2, 



533 QLYTKNDSLTGTDFGMTDDGPSDAYIGHLDYGGGVECDALFPDWGRPDVAEWWGNNYKKL 
539 GLYTKNDSLTNTNFGTTNDGPSDAYIGHLDYGGGGNCDALFPDWGRPGVAEWWGDNYSKL 
********* * ** * **************** *********** ****** ** ** 



SINl, 
SIN2, 



593 FSIGLDFVWQDMTVPAMMPHKIGDDINVKPDGNWPNADDPSNGQYNWKTYHPQVLVTDMR 
599 FKIGLDFVWQDMTVPAMMPHKVGDAVDTRSPYGWPNENDPSNGRYNWKSYHPQVLVTDMR 



SINl, 
SIN2, 



653 
659 



YENHGREPMVTQRNIHAYTLCESTRKEGIVENADTLTKFRRSYIISRGGYIGNQHFGGMW 
YENHGREPMFTQRNMHAYTLCESTRKEGIVANADTLTKFRRSYIISRGGYIGNQHFGGMW 



SINl, 713 VGDNSTTSNYIQMMIANNINMNMSCLPLVGSDIGGFTSYDNENQRTPCTGDLMVRYVQAG 
SIN2, 719 VGDNSSSQRYLQMMIANIVNMNMSCLPLVGSDIGGFTSYDGRNV CPGDLMVRFVQAG 



SINl, 773 CLLPWFRNHYDRWIESKDHGKDYQELYMYPNEMDTLRKFVEFRYRWQEVLYTAMYQNAAF 

SIN2 , 77 6 CLLPWFRNHYGRLVEGKQEGKYYQELYMYKDEMATLRKFIEFRYRWQEVLYTAMYQNAAF 

SINl, 833 GKPIIKAASMYNNDSNVRRAQNDHFLLGGHDGYRILCAPVVWENSTERELYLPVLTQWYK 

SIN2, 83 6 GKPIIKAASMYDNDRNVRGAQDDHFLLGGHDGYRILCAPVVWENTTSRDLYLPVLTKWYK 



SINl, 893 FGPDFDTKPLEGAMNGGDRIYNYPVPQSESPIFVREGAILPTRYTLNGENKSLNTYTDED 

SIN2, 89 6 FGPDYDTKRLDSALDGGQMIKNYSVPQSDSPIFVREGAILPTRYTLDGSNKSMNTYTDKD 

SINl, 953 PLVFEVFPLGNNRADGMCYLDDGGVTTNAEDNGKFSVVKVAAEQDGGTETITFTNDCYEY 

SIN2, 95 6 PLVFEVFPLGNNRADGMCYLDDGGITTDAEDHGKFSVINVEALRKGVTTTIKFAYDTYQY 

■*■***★**■**★*■*★***•*****■■*■*■* ** *** -kirir-k-k *• * * * ** * * ★ ■* 

SINl, 1013 VFGGPFYVRVRGAQSPSNIHVSSGAGSQDMKVSSATSRAALFNDGENGDFWVDQETDSLW 

SIN2, 1016 VFDGPFYVRIRNLTTASKINVSSGAGEEDMTPTSANSRAALFSDGGVGEYWADNDTSSLW 

★ * ★ +* +* ****** ** ★ * * * *** 



SINl, 1073 LKLPNVVLPDAVITIT 

SIN2, 107 6 MKLPNLVLQDAVITIT 



2. Sequence Id No. 1 (SINl) aligned with Sequence Id No. 3 (SINS) 
Identity : 286 (26.83%) 

Similarity: 166 (15.57%) 

Number of gaps inserted in GL1FULL: 59 

Number of gaps inserted in MCSY1: 67 

GL1FULL (=S!N1). Total number of residues: 1088. 

MCSY1. (=SIN3). Total number of residues: 1066. 

Comparison matrix : Structure-genetic matrix. 
Open gap cost : 2 
Unit gap cost : 2 

The character to show that two aligned residues are identical is *|' 

The character to show that two aligned residues are similor is 

Amino acids said to be 'similar' are: A.S.T; D,E; N.Q; R.K; l,UM.V; F.Y.W 



GL1FULL - M— FS-TLAFVAPSALGASTFVGAEVRSNVRIHSAFPAVHTATRKTNRLN -47 

Ml 1 1 • I . . .1 IN.. 

MCSYl - MAGFSDPLNFCKAEDYYS— VALDWKGPQKI— l-GVDHPPKSTKFP -43 

GL1 FULL - VSMTALSDKQTATAGSTDNPDGIOY-KTYDYVGVWG— FSP-LSNTNWFA -93 

I ... II . I . . 
MCSYl - KNWHGVNLRF-— DOGT— LGWQFIRPC VWRVRYDPGFKTSDEYG -84 

GL1FULL - AGSSTPGGITDWTATMNVNFDRIDN-PSITVQHPVQVQVTSYNNNS-YRV -141 

MCSYl - DENTRTI-VQDYMSTLS— NKLDTYRGLTWE TKCEDSGDF-F -122 

GL1FULL - RFNPDGPIRDVTRGPILKQQL-DWIRTQELSEGCDPGMTFSEGFLTFET -190 

I . I .... I .1 I I I 
MCSYl - TFS— SKVTAVEKSERTRNKVGDGLRtH-LWKS P-FRIQWRTLTP -164 



GL1FULL 



- -KDLSVIIYGNFKTRVTRKSDGKVIMENDEVGTASSGNKCRGLMFVDRLY -239 



II I I . -I II II Ml II 

MCSYl - LKDPYPI- -PNVAAAEARVSD-KV VWQTSP— K TF--R 195 

GL1FULL - GNAIASVNKNFRN— DAVK— QEGFYGAGEVNCKYQDTYILERTGIAMT -284 

. \ ■ I II . I II- I ■■ I I I 
MCSYl - -KNLHPQHKMLKDTVLDIVKPGHGEYVGWGEMGG-IQ— FMKEPT-F-MN -239 

GL1FULL - NYNYONLNYNQWDLRPPHHOGALNP-DYYIPMYYAAPWLI-VNGCAGTSE -332 

.|.|j..|.| . Ill . |.| . |. . II I 
MCSYl - YFNFDNMQYQQ— VYA— QGALDSRE— PLYHSDPFYLDVNS— NPE -278 

GL1FULL - QYSYGWFMDNVSQSYMNT-GDHWNSGQEDUY— MGAQYGPFDQHFVYG -379 

MCSY1 - H KNITATFIDNYSQIAIDFGKTNSGYIKLGTRYGGID-CYGIS -320 

GL1FULL - AGGGMECWTAFSLLQGKEFENQVLNKRSVMPPKYVFGFFQGVFGTSSLL -429 

I I .1 .. II. .1 |.|. I I .1 
MCSYl - ADTVPE-IVRLYTGLVGR— -SKL— K PRYILGAHQACYGYQQ 357 

GLIFULL - RAHMPAGENNI-SVEEIVEGYQNNNFPFEGLAVDVDMQDNLRVFTTKGEF -478 

Ml I I II i IIINI I III 
MCSYl ESDLYSV— VQQYRDCKFPLDGIHVDVDVQDGFRTFHN 394 

GLIFULL - WTANRVGTGGDPNNRSVFE-WAHDKGLVCQTNITCFLRNON-EGQDY 523 

MCSYl PHTF— '— PNPKEMFTNL-RNNGIKCSTNITPVISINNREG-GYSTL -435 

GL1FULL - -E-VNQT— LRERQLYTKNDSLTGTD FG MTD-DGPSD -555 

II ..nil -I III 

MCSYl - LEGVDKKYFIMD-DRYTEGTSGNAKDVRYMYYGGGNKVEVDPNDVNGRPD -484 

GLIFULL AYIGHLD— YG— GGV ECDALFPDWGRPDVAEWWGNNY -589 

I III .11 I .1 III .1 

MCSYl - FKDNYDFPANFNSKQYPYHGGVSYGYGNGSAGFYPDLNRKEVRIWWGMQY -534 

GLIFULL - KKLFSIGLDFVWQDMIVPAMMPHK-IGDDINVKPDGNWPN— ADDPSNG -635 

I II .ll.lllllll . Ill II I 
MCSYl - KYLFDMGLEFVWQDMT— TPAIHTSYGD MKG-LPTRLLV— TSDS -574 

GLIFULL - QYNWKIYHPQVLVTDMRYENHGREPMVTQRNIHAYTLCESTRKEGI— VE -683 

MCSYl VTNA— S-EKKLAIETWALYSYNLHKATWH-GLSRLE -607 

GLIFULL - NADTLTKFRRSYIISRGGYIGNQHFGGMWVGDNSnSNYIQMMIANNINM -733 

1. 1 .Mill I mil 

MCSYl - S RKNKRNFILGRGSYAGAYRFAGLWTGDNASNWEFWKISVSQVLSL -653 

GLIFULL - NMSCLPLVGSDIGGFTSYDNEN— QRTPCTGDLMVRYVQAGCLLPWFRNH -781 

. . . Ill III I I |. .|..|. nil III 
MCSYl - GLNGVCIAGSDTGGFEPYRDANGVEEKYCSPELLIRWYTGSFLLPWLRNH -703 

GLIFULL - YDRWIESKDHGKDYQELYMYPN— E— MD TL RKFVEF -814 

I . II I .11 I III . I .11 

MCSYl - YVK KDR-KWFQEPYSYPKHLETHPELADQAWLYKSVLEICRYYVEL -748. 

GLIFULL - RYRWQEVLYTAMYQNAAFGKPIIKAASMYNN-DSNVRRAQNDHFL L -859 

II .11 I.|l I II ... . I. II . 

MCSYl - RYSLIQLLYDCMFQNWDGMPITRSMLLTDTEDTTFF-NESQKFLDNQYM -797 



GLIFULL - GGHDGYRILCAPWWENS-TE RELYLPVLTQWY— KFGP-DFDT- -899 

I I II II.- I I I..|l|. II III 
MCSY1 - AGOD— ILVAPIL— HSRKEIPGENRDVYLPLYHTWYPSNLRPWD-OQG -841 

GLIFULL KPLEGAM— NGGDRIY— -NYPVPQSESPIFVREGAILPTRYTLNG -940 

Ml I II I . I . 
MCSY1 - VALGNPVEGGSVINYTARIVAPEDYNLFHSWPVYVREGAIIPQ— I 886 

GLIFULL - ENKSLNTYTDEDPLVFEVFPLGNNRADGMCYLDDGGVnNA— EDNGKFS -988 

I . . I ..II . mil I. . II 

MCSY1 - EVRQWTGQGGANRIKFNIYP-GKOKEYCT-YLDDG-VSRDSAPED— LP -930 
GLIFULL - WKVAAEQD— GGTETIT— F-TNDCYEYVFG-GP FYVRVRGAQSP -1028 

Ml 1-1 |. Ml . .1 .!• 

MCSY1 - QYKETHEQSKVEGAE-IAKQIGKKTGYN-ISGTDPEAKGYHRKVAVTQTS -978 

GLIFULL - SNIHVSSGAGSQDMKVSS-ATSRAA L FNDG 1057 

....j |. I I 

MCSY1 - KD— KTRTVTIEPKHNGYDPSKEVGDYnilLWYAPGFDGSIVDVSKTT -1025 

GLIFULL - ENGDFWVDQE TD— SLWLKLPNWL— PDAV-ITIT -1088 

I . |. .1 .. . . |. .1 III 
MCSY1 - VNVEGGVEHQVYKNSDLHTWIDV-KEVIGTTKSVKITCTAA -1066 



3. Sequence Id No. 1 (SINl) aligned with Sequence Id No. 4 (SIN4) 

Identity : 287 (26.82%) 

Similarity: 160 (14.95%) 

Number of gops inserted in GLIFULL: 64 

Number of gops inserted in MVSY1: 63 

* ALIGNMENT OF TWO PROTEIN SEQUENCES. * 

The two sequences to be aligned ore: 

GL1FULL (=SIN1). Total number of residues: 1088. 
MVSY1 (=SIN4). Total number of residues: 1070. 

Comparison matrix : Structure-genetic matrix. 
Open gap cost : 2 
Unit gap cost : 2 

Ttie cfiarocter to stiow that two aligned residues are identical is '|' 

The character to show that two aligned residues are similar is '.' 

Amino acids said to be 'similar' are: A,S,T; D,E; N,Q; R,K; l,L,M,V; F,Y,W 



GLIFULL - MFSTLAFVAPSALGAS-TFVGAEVRSNVRIHSAFPAVH— TATRKTNRLN -47 

I .1 I II . I . I I 
MVSY1 - M AGLSDPLNFCKAE— -DY YAAAKGWSGPQKIIRYD -33 

GLIFULL - VSMTALSDKQTATAGSTDN-P— DGIDYKTYDYVG— VWG— FSP-LSNT -89 

. I ■ ■ I III .III . I . . 
MVSYl - QTP-PQGTKD-PKSWHAVNLPFDDGTMCW-QFVRPCVWRVRYDPSVKTS -80 

GLIFULL - NWFAAGSSTPGGITDWTATMNVNFORIDNPSITVQHPVQVQVTSYNNNS- -138 

. . . I. .|.| I .1 |.. 
MVSYl - DEYGDENTRTI-VQDYMTTLVGNLDIF— RGLTW VSTLEDSGE -120 



GLIFULL - YRVRFNPDGPIRD\ATRGPILKQQLDWIRTQELSEGCDPGMTFTSEGFL 186 

I I . I I .11 II. .1 

MVSY1 - Y-YTFKSEVTAVOET E— RTR— NKVGO-GLKI YLWK -151 

GLIFULL - -TFETKDLSVIIYGNFKTRVTRKSDGKVIMENDEVGTASSGNKCRGLMFV -235 
MVSY1 - NPFRIQWRLL TPLV— DPFPIP-N— VANATA R— VA -182 

GLIFULL - DRLYGNAIA-SVNKNFR— NDAVKQEGF— YGAGEVNCKYQDTYIL— ER -278 

I II .1 I M 

MVSY1 - OKWWQTSPKTFRKNLHPQHKMLKDWLDIIKPG HGEYVGWGEM -226 

GLIFULL - TGIA MT--NY-NYDNLNYNQWDLRPPHHDGALNP-DYYIPMYYAAP -320 

11 I II I.||..|.| ■ III . |.|.| 
MVSY1 - GGIEFMKEPTFMNYFNFDNMQYQQ--VYA— QGALDSRE— PLYHSDP -268 

GL1FULL - WLI-VNGCAGTSEQYSYGWFMDNVSQSYMNT-GDTTWNSGQEDLAY— MG -366 

MVSY1 - FYLDVNS— -NPEH KNITATFIONYSQIAIDFGKTNSGYIKLG -308 

GLIFULL - AOYGPFDQHFVYGAGGGMECWTAFSLLQGKEFENQVLNKRSVMPPKYVF -416 

. II I . I |.| .. II. .1 |.|. 
MVSYl - TRYGGID-CYGISADWPE-IVRLYTGLVGR— SKL— K PRYIL -346 

GLIFULL - GFFQGVFGTSSLLRAHMPAGENNISVEEIVEGYQNNNFPFEGLTWDVDMQ -466 

II .1 I . .11 II .11 nil I 

MVSYl - GAHQACYGYQQ — ESDLHA— WQQYRDTKFPLDGLHVDVDFQ -385 

GLIFULL - DNLRVFnKG-EFWTANRVGTGGDPNNRSVFEWAHDKGLVCQTNITCFL- -514 

mill I I . . Mi . 

MVSYl - DNFRTEHNPITF— PNPK EMFT-NL RNNGIKCSTNITPVIS -424 

GLIFULL - -RN— D NEGQD— Y EVN QTLR ERQL-Y -535 

I III II I I -I 

MVSYl - IRDRPNGYSTLNEGYDKKYFIMDDRYTEGTSGDPQNVRYSFYGGGNPVEV -474 

GLIFULL - TKNOSLTGTDFG MTD DGPSDAYIGHLOYGGGVECDALFPDW -576 

MVSYl - NPNDVWARPDFGDNYDFPTNFNCKDYPYHGGVSY-GYGNGTP— GYYPOL -521 

GLIFULL - GRPDVAEWWGNNYKKLFSIGLDFVWQDMTVPAMMPHKIGDDINVKPDGNW -626 

I .1 III .1 II .ll.lllllll II I 
MVSYl - NREEVRIWWGLQYEYLFNMGLEFVWQOMTTPA IH SSY -558 

GLIFULL - PNADDPSNGQYNWKTYHPQVLVTDMRYENHG-REPMVTQRNIHAYTLCES -675 

MVSYl GDM-KG LPTRLLVTADSVTNASEKKLAIESWALYSYNLHKA -598 

GLIFULL - TRKE-GIVENADTLTKFRRSYIISRGGYIGNQHFGGMWVGDNSnSNYIQ -724 

I I .1 I .1 .|. II I I I l-l lll-.l - 
MVSYl - TFHGLGRLES RKNKRNFILGRGSYAGAYRFAGLWTGDNASTWEFWK -644 

GLIFULL - MMIANNINMNMSCLPLVGSDIGGFTSYDNE-NQRTPCTGDLMVRYVQAGC -773 

Ill III I . |. .|..|. 

MVSYl - ISVSQVLSLGLNGVCIAGSDTGGFEPARTEIGEEKYCSPELLIRWYTGSF -694 

GLIFULL - LLPWFRNHYDRWIESKDHGKOYQELYMYPNEMDT— L R 809 

nil nil . II I .n i w ..i i . 



MVSY1 - LLPWLRNHYVK KDR-KWFQEPYAYPKHLETHPELADQAWLYKSVLE -739 

GLIFULL KF-VEFRYRWQEVLYTAMYQNAA FGK-PIIKAA--SMYNNDSNV -849 

•• II II .11 l-ll I.|. 

MVSY1 - ICRYWVELRYSLIQLLYDCMFQNWDGMPLARSMLLTDTEOTTFFNESQ- -788 

GLIFULL - RRAQNDHFLLGGHOGYRILCAPWWENS-TE RELYLPVLTQWY 891 

. I I II II.. I I |..|l|. II 

MVSY1 - -KFLDNQYM-AGDD— ILVAPIL— HSRNEVPGENRDVYLPLFHTWYPS -831 

GLIFULL - KFGP-DFDT KPLEGAM— NGGDRIY— NYPVPQSESPIFVREGA -930 

III |.|l I II I- Nlll 
MVSY1 - NLRPWD-DQGVALGNPVEGGSVINYTARIVAPEDYNLFHNWPVYIREGA -880 

GLIFULL - ILPT— RYTLNGENKSLNITTDEDPLVFEVFPLGNNRADGMCYLODGGV -977 

i.i I II i.i.-ii . mil I 

MVSY1 - IIPQIQVR-QWIGE— G GPNPIKFNIYP-GKDKEYVT-YLDDG-V -919 

GLIFULL - HNAEDNGKFSWKVAAEQD— GGTET— ITF TNDCYEYVFGGP -1017 

. I II I . .. Ill I 
MVSY1 - SRDSAPDD-LPQYREAYEQAKVEGKDVQKQLAVIQGNKTND FSAS -963 

GLIFULL - FWR-VRGAQSPSNI-HVS— SGAGSQDMKVSS-ATSRAA LF -1054 

. .1 I I .... I I. I. 
MVSY1 - GIOKEAKGYHRKVSIKQESKDKTRmiEPKHNGYOPSKEVGNnrTIILW -1013 

GLIFULL - NDGE-NGDFWVD— QETDSL— WLK — LPNWLPDA— Vl-T 1086 

I II I I . . I II. II I 

MVSY1 - YAPGFDGSI-VDVSQATVNIEGGVECEIFKNTGLHTWV-NVKEVIGnK -1061 

GLIFULL IT -1088 

MVSY1 - SVKITCnA -1070 



4. Sequence Id No. 1 (SINl) aligned with Sequence Id No. 5 (SINS) 

Sequence 1: SINl, (1088 residues) 
Sequence 2: SINS, (1092 residues) 

using the parameters : 
Comparison matrix: BL0SUM62 
Number of alignments computed: 20 
Gap open penalty: 12 
Gap extension penalty: 4 

74.6% identity in 1096 residues overlap; Score: 4455.0; Gap frequency: 1.3% 

SINl, 1 MFSTLAFVAPSALGASTFVGAEVRSNVRIHSAFPAVHTATRKTNRLNVSMTALSDKQTAT 

SINS, 1 MFPTLTFIAPSALAASTFVGADIRSGIRIQSALPAVRNAVRRSKHYNVSMTALSDKQTAI 

** * ***** ******* ** ** ** *** * * ************* 

SINl, 61 AGSTDNPDGIDYKTYDYVGVWGFSPLSNTNWFAAGSSTPGGITDWTATMNVNFDRIDNPS 

SINS, 61 SIGPDNPDGINYQNYDYIPVAGFTPLSNTNWYAAGSSTPGGITDWTATMNVKFDRIDNPS 

****** * *** * ** ******* ******************* ******** 

SINl, 121 ITVQHPVQVQVTSYNNNSYRVRFNPDGPIRDVTRGPILKQQLDWIRTQELSEGCDPGMTF 

SINS, 121 YSNNHPVQIQVTSYNNNSFRIRFNPDGPIRDVSRGPILKQQLTWIRNQELAQGCNPNMSF 

**** ********* * *********** ********* *** *** ** * * * 

SINl, 181 TSEGFLTFETKDLSVIIYGNFKTRVTRKSDGKVIMENDEVGTASSGNKCRGLMFVDRLYG 

SINS, 181 SPEGFLSFETKDLNVIIYGNCKMRVTKK-DGYLVMENDECNSQSDGNKCRGLMYVDRLYG 

**** ****** ****** * *** * ** ***** * ******** ****** 



SINl, 
SINS, 



241 NAIASVNKNFRNDAVKQEGFYGAGEVNCKYQD TYILERTGIAMTNYNYDNLNYNQ 

240 NAIASVQTNFHKDTSRNEKFYGAGEVNCRYEEQGKAPTYVLERSGLAMTNYNYDNLNYNQ 



SINl, 
SINS, 



296 
300 



WDLRPPHHDGALNPDYYIPMYYAAPWLIVNGCAGTSEQYSYGWFMDNVSQSYMNTGDTTW 
PDVVPPGYPD--HPNYYIPMYYAAPWLVVQGCAGTSKQYSYGWFMDNVSQSYMNTGDTAW 
* ★ **** ******* * * ****** ********************* * 



SINl, 
SINS, 



356 NSGQEDLAYMGAQYGPFDQHFVYGAGGGMECVVTAFSLLQGKEFENQVLNKRSVMPPKYV 
358 NCGQENLAYMGAQYGPFDQHFVYGDGDGLEDVVKAFSFLQGKEFEDKKLNKRSVMPPKYV 
* *** ****************** * * * ** *** ******* ************ 



SINl, 
SINS, 



416 FGFFQGVFGTSSLLRAHMPAGENNISVEEIVEGYQNNNFPFEGLAVDVDMQDNLRVFTTK 
418 FGFFQGVFGALSLLKQNLPAGENNISVQEIVEGYQDNDYPFEGLAVDVDMQDDLRVFTTK 
********* *** ********* ******* * ************* ******* 



SINl, 
SINS, 



47 6 GEFWTANRVGTGGDPNNRSVFEWAHDKGLVCQTNITCFLRNDNEGQDYEVNQTLRERQLY 
478 PEYWSANMVGEGGDPNNRSVFEWAHDRGLVCQTNVTCFLRNDNSGKPYEVNQTLREKQLY 
* * ** ** *************** ******* ******** * ********* *** 



SINl, 
SINS, 



53 6 TKNDSLTGTDFGMTDDGPSDAYIGHLDYGGGVECDALFPDWGRPDVAEWWGNNYKKLFSI 
538 TKNDSLNNTDFGTTSDGPGDAYIGHLDYGGGVECDAIFPDWGRPDVAQWWGENYKKLFSI 
****** **** * *** ***************** ********** *** ******** 



SINl, 
SINS, 



596 GLDFVWQDMTVPAMMPHKIGDDINVKPDGN WPNADDPSNGQYNWKT YHPQVLVTDMR 

598 GLDFVWQDMTVPAMMPHRLGDAVNKNSGSSAPGWPNENDPSNGRYNWKS YHPQVLVTDMR 
***************** ** * *** ***** **** *********** 



SINl, 
SINS, 



653 Y-ENHGREPMVTQRNIHAYTLCESTRKEGIVENADTLTKFRRSYIISRGGYIGNQHFGGM 
658 YGAEYGREPMVSQRNIHAYTLCESTRREGIVGNADSLTKFRRSYIISRGGYIGNQHFGGM 
* ****** ************** **** *** ************************ 



SINl, 
SINS, 



712 WVGDNSTTSNYIQMMIANNINMNMSCLPLVGSDIGGFTSYDNENQRTPCTGDLMVRYVQA 
718 WVGDNSATESYLQMMLANIINMNMSCLPLVGSDIGGFTQYNDAGDPTP — EDLMVRFVQA 
****** * * *** ** ******************* * ** ***** *** 



SINl, 
SINS, 



772 GCLLPWFRNHYDRWIESKDHGKDYQELYMYPNEMDTLRKFVEFRYRWQEVLYTAMYQNAA 
77 6 GCLLPWFRNHYDRWIESKKHGKKYQELYMYPGQKDTLKKFVEFRYRWQEVLYTAMYQNAT 
****************** *** ******** *** ********************* 



SINl, 
SINS, 



832 FGKPIIKAASMYNNDSNVRRAQNDHFLLGGHDGYRILCAPVVWENSTERELYLPVLTQWY 
836 TGEPIIKAAPMYNNDVNVYKSQNDHFLLGGHDGYRILCAPVVRENATSREVYLPVYSKWF 
* ****** ***** ** ********************* ** * ** **** * 



SINl, 
SINS, 



892 KFGPDFDTKPLEGAMNGGDRIYNYPVPQSESPIFVREGAILPTRYTLNGENKSLNTYTDE 
896 KFGPDFDTKPLENEIQGGQTLYNYAAPLNDSPIFVREGTILPTRYTLDGVNKSINTYTDN 
************ ** *** * ******** ******** * *** ***** 



SINl, 
SINS, 



952 DPLVFEVFPLGNNRADGMCYLDDGGVTTNAEDNGKFSVVKVAAEQDGGTETITFTNDCYE 
956 DPLVFELFPLENNQAHGLFYHDDGGVTTNAEDFGKYSVISVKAAQEGSQMSVKFDNEVYE 
****** *** ** * * * *********** ** ** * * * * * * ** 



SINl, 
SINS, 



1012 YVFGGPFYVRVRGAQSPSNIHVSSGAGSQDMKVSSATSRAALFNDGENGDFWVDQETDSL 
1016 HQWGASFYVRVRNMGAPSNINVSSQIGQQDMQQSSVSSRAQMFTSANDGEYWVDQSTNSL 
* ****** **** +** * *** ** *** * * **** * ** 



SINl, 
SINS, 



1072 WLKLPNVVLPDAVITI 
107 6 WLKLPGAVIQDAAITV 



tr**** * ** ** 



5. Sequence Id No. 1 (SINl) aligned with Sequence Id No. 6 (SIN6) 

Sequence 1: SINl, (1088 residues) 
Sequence 2: SIN6, (570 residues) 

using the parameters: 
Comparison matrix: BL0SUM62 
Number of alignments computed: 20 
Gap open penalty: 12 
Gap extension penalty: 4 

77.9% identity in 578 residues overlap; Score: 2451.0; Gap frequency: 1.9% 

SINl, 283 MTNYNYDNLNYNQWDLRPPHHDGALNPDYYIPMYYAAPWLIVNGCAGTSEQYSYGWFMDN 

SIN 6, 1 MTNYNYDNLNYNQPDLIPPGHDS — DPDYYI PMYFAAPWVI AHGYRGTSDQYSYGWFLDN 

************* ** +* ** ******** **** * * *** ******* ** 

SINl, 343 VSQS YMNTGDTTWNSGQEDLAYMGAQYGP FDQHFVYGAGGGMEC VVTAFS LLQGKEFENQ 

SIN6, 59 VSQSYTNTGDDAW-AGQKDLAYMGAQCGPFDQHFVYEAGDGLEDVVTAFSYLQGKEYENQ 

***** **** * ** ******** ********* ** * * ****** ***** *** 



SINl, 



4 03 VLNKRSVMPPKYVFGFFQGVFGTSSLLRAHMPAGENNISVEEIVEGYQNNNFPFEGLAVD 



SIN6, 118 GLNIRSAMPPKYVFGFFQGVFGATSLLRDNLPAGENNVSLEEIVEGYQNQNVPFEGLAVD 

★ * *★ ★**★*★ + ★******* + * * + 

SINl , 4 63 VDMQDNLRVFTTKGEFWTANRVGTGGDPNNRSVFEWAHDKGLVCQTNITCFLRNDNEGQD 

SIN 6, 178 VDMQDDLRVFTTRPAFWTANKVGEGGDPNNKSVFEWAHDRGLVCQTNVTCFLK — NEKNP 

+ ****** ***** ** ****** ******** ******* **** ** 

SINl , 523 YEVNQTLRERQLYTKNDSLTGTDFGMTDDGPSDAYIGHLDYGGGVECDALFPDWGRPDVA 

SIN6, 236 YEVNQSLREKQLYTKSDSLDNIDFGTTPDGPSDAYIGHLDYGGGVECDALFPDWGRPDVA 

***** *** ***** *** *** * ******************************** 

SINl, 583 EWWGNNYKKLFSIGLDFVWQDMTVPAMMPHKIGDDINVKPDGN WPNADDPSNGQYNW 

SIN6, 296 QWWGDNYKKLFSIGLDFVWQDMTVPAMMPHRLGDPVGTNSGETAPGWPNDKDPSNGRYNW 

*** ************************* ** *** ***** *** 

SINl, 640 KTYHPQVLVTDMRYENHGREPMVTQRNIHAYTLCESTRKEGIVENADTLTKFRRSYIISR 

SIN6, 356 KSYHPQVLVTDMRYDDYGRDPIVTQRNLHAYTLCESTRREGIVGNADSLTKFRRSYIISR 

* ************ ** * ***** ********** **** *** ************ 

SINl, 700 GGYIGNQHFGGMWVGDNSTTSNYIQMMIANNINMNMSCLPLVGSDIGGFTSYDNENQRTP 

SIN6, 416 GGYIGNQHFGGMWVGDNSSTEDYLAMMVINVINMNMSGVPLVGSDIGGFTEHDKRN P 

****************** * * ** * ****** *********** * * * 

SINl, 7 60 CTGDLMVRYVQAGCLLPWFRNHYDRWIESKDHGKDYQELYMYPNEMDTLRKFVEFRYRWQ 

S IN 6 , 473 CTPDLMMRFVQAGCLLPWFRNHYDRWI ESKKHGKNYQEL YMYRDHLDALRS FVELRYRWQ 

** *** * ********************* *** ******* * ** *** ***** 



SINl, 820 EVLYTAMYQNAAFGKPIIKAASMYNNDSNVRRAQNDHF 

SIN 6, 533 EVLYTANYQNALNGKPIIKTVSMYNNDMNVKDAQNDHF 

*********** ****** ****** ** ****** 



===03-MAY-2001== 



==NALIGN=- 



=PC/GENE 



****************************************** 

* ALIGNMENT OF TWO NUCLEOTIDE SEQUENCES. * 
****************************************** 

The two sequences to be aligned are: 
SEQIDN07. 

Total number of bases: 3267. 
SEQIDN08. 

Total number of bases: 3276. 

Open gap cost : 50 

Unit gap cost : 20 



The character to show that two aligned residues are identical is ' 
SEQIDN07 
SEQIDN08 



ATGTTTTCAACCCTTGCGTTTGTCGCACCTAGTGCGCTGGGAGCCAGTAC -50 

MM I MMMI I M M M MMMMMI M M I M M 

ATGTATCCAACCCTCACCTTCGTGGCGCCTAGTGCGCTAGGGGCCAGAAC -50 



SEQIDN07 - CTTCGTAGGGGCGGAGGTC AGGTCAAATGTTCGTATCCATTCCGCTT -97 

Ml MM I MMM I Ml Ml MMI I I 

SEQIDN08 - TTTCACGTGTGTGGGCATTTTTAGGTCACACATTCTTATTCATTCGGTTG -100 
SEQIDN07 - TTCCAGCTGTGCACACAGCTACTCGCAAAACCAATCGCCTCAATGTATCC -147 

MMMMMI MM I II I II II II II II II I M I II II I 

SEQIDN08 - TTCCAGCGGTGCGTCTAGCTGTGCGCAAAAGCAACCGCCTCAATGTATCC -150 
SEQIDN07 - ATGACCGCATTGTCCGACAAACAAACGGCTACTGCGGGTAGTACAGACAA -197 

III MM MM llllllll II III II M I I Mill 

SEQIDN08 - ATGTCCGCTTTGTTCGACAAACCGACTGCTGTTACTGGAGGGAAGGACAA -200 
SEQIDN07 - TCCGGACGGTATCGACTACAAGACCTACGATTACGTCGGAGTATGGGGTT -247 

MMM MM MMI M II II IMIII II III I I 

SEQIDN08 - CCCGGACAATATCAATTACACCACTTATGACTACGTCCCTGTGTGGCGCT -250 
SEQIDN07 - TCAGCCCCCTCTCCAACACGAACTGGTTTGCTGCCGGCTCTTCTACCCCG -297 

II IMIIII III llllllimillllMIII Mill M II 

SEQIDN08 - TCGACCCCCTCAGCAATACGAACTGGTTTGCTGCCGGATCTTCCACTCCC -300 



SEQIDN07 - GGTGGCATCACTGATTGGACGGCTACAATGAATGTCAACTTCGACCGTAT -347 

II I II II llllllll IIIIIIIIIM IIIIMIIIIIIII 

SEQIDN08 - GGCGATATTGACGACTGGACGGCGACAATGAATGTGAACTTCGACCGTAT -350 

SEQIDN07 - CGACAATCCGTCCATCACTGTCCAGCATCCCGTTCAGGTTCAGGTCACGT -397 

II III I III III llll'l II II I II II II III I II II II II III 
SEQIDN08 - CGACAATCCATCCTTCACTCTCGAGAAACCGGTTCAGGTTCAGGTCACGT -400 

SEQIDN07 - CATACAACAACAACAGCTACAGGGTTCGCTTCAACCCTGATGGCCCTATT -447 

MMI I Mill I I IIIIIIIIIIIIIIIIIIIIIIMIIIIIII 

SEQIDN08 - CATACAAGAACAATTGTTTCAGGGTTCGCTTCAACCCTGATGGTCCTATT -450 



SEQIDN07 - CGTGATGTGACTCGTGGGCCTATCCTCAAGCAGCAACTAGATTGGATTCG -497 

II llllll IIIIMMIillllll lllllllllll IIIIIM II 

SEQIDN08 - CGCGATGTGGATCGTGGGCCTATCCTCCAGCAGCAACTAAATTGGATCCG -500 
SEQIDN07 - AACGCAGGAGCTGTCAGAGGGATGTGATCCCGGAATGACTTTCACATCAG -547 

I IIIIMII III INI I llllll III llllll II 

SEQIDN08 - GAAGCAGGAGCAGTCGAAGGGGTTTGATCCTAAGATGGGCTTCACAAAAG -550 
SEQIDN07 - AAGGTTTCTTGACTTTTGAGACCAAGGATCTAAGCGTCATCATCTACGGA -597 

llllllllllll lllllllllllllllll I III Mill II II 

SEQIDN08 - AAGGTTTCTTGAAATTTGAGACCAAGGATCTGAACGTTATCATATATGGC -600 
SEQIDN07 - AATTTCAAGACCAGAGTTACGAGAAAGTCTGACGGCAAGGTCATCATGGA -647 

Mill mil lllllllllll Ml II II II I IIIIMII 

SEQIDN08 - AATTTTAAGACTAGAGTTACGAGGAAGAGGGATGGAAAAGGGATCATGGA -650 
SEQIDN07 - AAATGATGAAGTTGGAACTGCATCGTCCGGGAACAAGTGCCGGGGATTGA -697 

III IIMIII I I IMII IIIIMIIIIIIIIIIIIIill 

SEQIDN08 - GAATAATGAAGTGCCGGCAGGATCGTTAGGGAACAAGTGCCGGGGATTGA -700 
SEQIDN07 - TGTTCGTTGATAGATTATACGGTAACGCTATCGCTTCCGTCAACAAGAAC -747 

!MI II II II II Mill I II lllllllllll II I II 

SEQIDN08 - TGTTTGTCGACAGGTTGTACGGCACTGCCATCGCTTCCGTTAATGAAAAT -750 
SEQIDN07 - TTCCGCAACGACGCGGTCAAGCAGGAGGGATTCTATGGTGCAGGTGAAGT -797 

I IIIIIMII llllll Mill IIIIIIIIIMIM Mill 

SEQIDN08 - TACCGCAACGATCCCGACAGGAAAGAGGGGTTCTATGGTGCAGGAGAAGT -800 
SEQIDN07 - CAACTGTAAGTACCAGGACACC TACATCTTAG -829 

Mill Ml MM II IMIIMIII 

SEQIDN08 - AAACTGCGAGTTTTGGGACTCCGAACAAAACAGGAACAAGTACATCTTAG -850 
SEQIDN07 - AACGCACTGGAATCGCCATGACAAATTACAACTACGATAACTTGAACTAT -879 

MM IIIIIIIIIIIIIIIIMIMIIIII II II MM Mill 

SEQIDN08 - AACGAACTGGAATCGCCATGACAAATTACAATTATGACAACTATAACTAC -900 
SEQrDN07 - AACCAGTGGGACCTTAGACCTCCGCATCATGATGGTGCCCTCAACCCAGA -929 

IIMMI II MM MM M I I MM I 

SEQIDN08 - AACCAGTCAGATCTTATTGCTCCAGGATATCCT TCCGACCCGAA -944 

SEQrDN07 - CTATTATATTCCAATGTACTACGCAGCACCTTGGTTGATCGTTAATGGAT -979 

II II Mill MM! I llllllllllll I I Mill MM 

SEQIDN08 - CTTCTACATTCCCATGTATTTTGCAGCACCTTGGGTAGTTGTTAAGGGAT -994 
SEQIDN07 - GCGCCGGTACTTCGGAG CAGTACTCGTATGGATGGTTCATGGACAAT -1026 

M II I II lllllllllll IIIIIMI Mill III 

SEQIDN08 - GCAGTGGCAACAGCGATGAACAGTACTCGTACGGATGGTTTATGGATAAT -1044 
SEQrDN07 - GTCTCTCAATCTTACATGAATACTGGAGATACTACCTGGAATTCTGGACA -1076 

MM! Ill IIMMIMIIIIIII I MM IIIIIM I MM I 

SEQIDN08 - GTCTCCCAAACTTACATGAATACTGGTGGTACTTCCTGGAACTGTGGAGA -1094 
SEQIDN07 - AGAGGACCTGGCATACATGGGCGCGCAGTATGGACCATTTGACCAACATT -1126 



SEQIDN08 
SEQIDN07 
SEQIDN08 
SEQIDN07 
SEQIDN08 
SEQIDN07 
SEQIDN08 
SEQIDN07 
SEQIDN08 
SEQIDN07 
SEQIDN08 
SEQIDN07 
SEQIDN08 
SEQIDN07 
SEQIDN08 
SEQIDN07 
SEQIDN08 
SEQIDN07 
SEQIDN08 
SEQIDN07 
SEQIDN08 
SEQIDN07 
SEQIDN08 
SEQIDN07 
SEQIDN08 
SEQIDN07 
SEQIDN08 



III II IIIIIIMIIIII II MM M IIIIIIIIMMIMI 

GGAGAACTTGGCATACATGGGAGCACAGTGCGGTCCATTTGACCAACATT 



-1144 



TTGTTTACGGTGCTGGGGGTGGGATGGAATGTGTGGTCACAGCGTTCTCT -1 176 

MM II MM III I III I M III III IIIMIIMI 

TTGTGTATGGTGATGGAGATGGTCTTGAGGATGTTGTCCAAGCGTTCTCT -1 194 
CTTCTACAAGGCAAGGAGTTCGAGAACCAAGTTCTCAACAAACGTTCAGT -1226 

Mill llllllll IIIM IIMIMIMIIII lllllllll I II 

CTTCTGCAAGGCAAAGAGTTTGAGAACCAAGTTCTGAACAAACGTGCCGT -1244 
AATGCCTCCGAAATACGTCTTTGGTTTCTTCCAGGGTGTTTTCGGGACTT -1 276 

MMMMMMMI II IIIIMI Ml Mill II II MM I 

AATGCCTCCGAAATATGTGTTTGGTTACTTTCAGGGAGTCTTTGGGATTG -1 294 
CTTCCTTGTTGAGAGCGCATATGCCAGCAGGTGAGAACAACATCTCAGTC -1326 

MMMMMMMI III I MM MM II IIIIMII II 

CTTCCTTGTTGAGAGAGCAAAGACCAGAGGGTGGTAATAACATCTCTGTT -1 344 
GAAGAAATTGTAGAAGGTTATCAAAACAACAATTTCCCTTTCGAGGGGCT -1376 

MM IIIM llllllll MM IM II I MM II I IIIIMI 

CAAGAGATTGTCGAAGGTTACCAAAGCAATAACTTCCCTTTAGAGGGGTT -1 394 
CGCTGTGGACGTGGATATGC AAGACAACTTGCGGGTGTTCACCACGAAGG - 1 426 

II II II IIMIIIIIIII I I IIMI MMMMMMMI 

AGCCGTAGATGTGGATATGCAACAAGATTTGCGCGTGTTCACCACGAAGA - 1 444 
GCGAATTTTGGACCGCAAAC AGGGTGGGTACTGGCGGGGATCCAAACAAC - 1 476 

MMIIMIM Mill MM II II II II M I II III 

TTGAATTTTGGACGGCAAATAAGGTAGGCACCGGGGGAGACTCG AAT AAC - 1 494 
CGATCGGTTTTTGAATGGGCACATGACAAAGGCCTTGTTTGTCAGACAAA -1526 

Mill IIIIMII I IIMIIIIIIII III Mill IIIIMII II 

AAGTCGGTGTTTGAATGGGCACATGACAAAGGCCTTGTATGTCAGACGAA -1 544 
TATAACTTGCTTCCTGAGGAATGATAACGAGGGGCAAGACTACGAGGTCA -1 576 

I I MIIIIMI MM II II MM III III Mill MM 

TGTTACTTGCTTCTTGAGAAACGACAACGGCGGGGCAGATTACGAAGTCA -1 594 
ATCAGACGTTAAGGGAGAGGCAGTTGTACACGAAGAACGACTCCCTGACG -1626 

lllllll M IIMIII I llllllllllllll Mill IIIIM 

ATCAGACATTGAGGGAGAAGGGTTTGTACACGAAGAATGACTCACTGACG -1 644 
GGTACGGATTTTGGAATGACCGACGACGGCCCCAGCGATGCGTACATCGG -1676 

II I II MM III lllllll II IIIIMIIIIIIII II 

AACACTAACTTCGGAACTACCAACGACGGGCCGAGCGATGCGTACATTGG - 1 694 
TCATCTGGACTATGGGGGTGGAGTAGAATGTGATGCACTTTTCCCAGACT -1 726 

MMMIMIMM II MM I I II II I II I M 1 1 II 1 1 M II I 

ACATCTGGACTATGGTGGCGGAGGGAATTGTGATGCACTTTTCCCAGACT -1 744 
GGGGACGGCCTGACGTGGCCGAATGGTGGGGAAATAACTATAAGAAACTG -1 776 

MM II II I IMM MMIIMIM lllllll I II II 

GGGGTCGACCGGGTGTGGCTGAATGGTGGGGTGATAACTACAGCAAGCTC -1 794 



SEQIDN07 - TTCAGCATTGGTCTCGACTTCGTCTGGCAAGACATGACTGTTCCAGCAAT -1826 

nil IMIIIII II IMIIIIIMIIIIIIMM IIMIIII II 

SEQIDN08 - TTCAAAATTGGTCTGGATTTCGTCTGGCAAGACATGACAGTTCCAGCTAT -1844 
SEQIDNO? - GATGCCGCACAAAATTGGCGATGACATCAATGTGAAACCGGATGGGAATT -1876 

mill mill iiiiiii I II II II I I I I 

SEQrDN08 - GATGCCACACAAAGTTGGCGACGCAGTCGATACGAGATCACCTTACGGCT -1894 
SEQIDN07 - GGCCGAATGCGGACGATCCGTCCAATGGACAATACAACTGGAAGACGTAC -1926 

iiiiiiiii I I mil II II MM mm iiiii i iii 

SEQIDN08 - GGCCGAATGAGAATGATCCTTCGAACGGACGATACAATTGGAAATCTTAC -1944 
SEQIDN07 - CATCCCCAAGTGCTTGTAACTGATATGCGTTATGAGAATCATGGTCGGGA -1976 

IIIII IIIII II llllllllllllll III IIIMIIIIII MM 

SEQIDN08 - CATCCACAAGTTCTCGTAACTGATATGCGATATGAGAATCATGGAAGGGA -1994 
SEQIDN07 - ACCGATGGTCACTCAACGCAACATTCATGCGTATACACTGTGCGAGTCTA -2026 

IIIIIII iiiiiiiiiiiii II IMIIIII IIIII II II nil 

SEQIDN08 - ACCGATGTTCACTCAACGCAATATGCATGCGTACACACTCTGTGAATCTA -2044 
SEQIDN07 - CTAGGAAGGAAGGGATCGTGGAAAACGCAGACACTCTAACGAAGTTCCGC -2076 

i iiiiiiiiiiiiii II I III iiiniiiiiiiMiiiiiiiiii 

SEQIDN08 - CGAGGAAGGAAGGGATTGTTGCAAATGCAGACACTCTAACGAAGTTCCGC -2094 
SEQIDN07 - CGTAGCTACATTATCAGTCGTGGTGGTTACATTGGTAACCAGCATTTCGG -2126 

II II II iiiiiiniiiiii iiiiiiiiiii iiiiiimii II 

SEQIDN08 - CGCAGTTATATTATCAGTCGTGGAGGTTACATTGGCAACCAGCATTTTGG -2144 
SEQIDN07 - GGGTATGTGGGTGGGAGACAACTCTACTACATCAAACTACATCCAAATGA -2176 

II llllllll IIIMIIIIIII I I II III IIIIIIIII 

SEQIDN08 - AGGAATGTGGGTTGGAGACAACTCTTCCTCCCAAAGATACCTCCAAATGA -2194 
SEQIDN07 - TGATTGCCAACAATATTAACATGAATATGTCTTGCTTGCCTCTCGTCGGC -2226 

nil II Mil I iiiiim iiiiimi i ii ii ii ii 

SEQIDN08 - TGATCGCGAACATCGTCAACATGAACATGTCTTGCCTTCCACTAGTTGGG -2244 
SEQIDN07 - TCCGACATTGGAGGATTCACCTCATACGACAATGAGAATCAGCGAACGCC -2276 

llllllllllllll II II II II 11 I nil 

SEQIDN08 - TCCGACATTGGAGGTTTTACTTCGTATGA TGGACGAAACGT -2285 

SEQIDN07 - GTGTACCGGGGACTTGATGGTGAGGTATGTGCAGGCGGGCTGCCTGTTGC -2326 

nil IIIIIII I mil II I nninini mi n 

SEQIDN08 - GTGTCCCGGGGATCTAATGGTAAGATTCGTGCAGGCGGGTTGCTTACTAC -2335 
SEQIDN07 - CGTGGTTCAGGAACCACTATGATAGGTGGATCGAGTCCAAGGACCACGGA -2376 

iniiiiiii iMiiiiiii inn i inn im i i m 

SEQIDN08 - CGTGGTTCAGAAACCACTATGGTAGGTTGGTCGAGGGCAAGCAAGAGGGA -2385 
SEQIDN07 - AAGGACTACCAGGAGCTGTACATGTATCCGAATGAAATGGATACGTTGAG -2426 

11 nil II II IIIIIIIIIII I I II MM Ml IIIII 

SEQIDN08 - AAATACTATCAAGAACTGTACATGTACAAGGACGAGATGGCTACATTGAG -2435 
SEQIDN07 - GAAGTTCGTTGAATTCCGTTATCGCTGGCAGGAAGTGTTGTACACGGCCA -2476 



II III Mlllllllllll IIIIMIIIII MMIMIIM 11 I 

SEQIDN08 - AAAATTCATTGAATTCCGTTACCGCTGGCAGGAGGTGTTGTACACTGCTA -2485 
SEQIDN07 - TGTACCAGAATGCGGCTTTCGGAAAGCCGATTATCAAGGCTGCTTCGATG -2526 

llllllllllllllllllllll II llllllllllllll Mill III 

SEQIDN08 - TGTACCAGAATGCGGCTTTCGGGAAACCGATTATCAAGGCAGCTTCCATG -2535 
SEQIDN07 - TACAATAACGACTCAAACGTTCGCAGGGCGCAGAACGATCATTTCCTTCT -2576 

III I mill iiiiiiiiM I Hill ill II iiiiiiii 

SEQIDN08 - TACGACAACGACAGAAACGTTCGCGGCGCACAGGATGACCACTTCCTTCT -2585 
SEQIDN07 - TGGTGGACATGATGGATATCGCATTCTGTGCGCGCCTGTTGTGTGGGAGA -2626 

II IIIM IIIIMIIIII III MM II IIIIIIIIIIIIMII 

SEQIDN08 - CGGCGGACACGATGGATATCGTATTTTGTGTGCACCTGTTGTGTGGGAGA -2635 
SEQIDN07 - ATTCGACCGAACGCGAATTGTACTTGCCCGTGCTGACCCAATGGTACAAA -2676 

II I III IIIM IIIM Mill Ml MM II II I MM MM 

SEQIDN08 - ATACAACCAGTCGCGATCTGTACTTGCCTGTGCTGACCAAATGGTACAAA -2685 
SEQIDN07 - TTCGGTCCCGACTTTGACACCAAGCCTCTGGAAGGAGCGATGAACGGAGG -2726 

Mill M IM I MUM I III I Mill III II I Mill 

SEQIDN08 - TTCGGCCCTGACTATGACACCAAGCGCCTGGATTCTGCGTTGGATGGAGG -2735 
SEQIDN07 - GGACCGAATTTACAACTACCCTGTACCGCAAAGTGAATCACCAATCTTCG -2776 

II III I Mill MM II Mill II II M II II I 

SEQIDN08 - GCAGATGATTAAGAACTATTCTGTGCCACAAAGCGACTCTCCGATATTTG -2785 
SEQIDN07 - TGAGAGAAGGTGCGATTCTCCCTACCCGCTACACGTTGAACGGTGAAAAC -2826 

MM Mill II IIIIIIIIMIIIIIIIIIIIIM Mill III 

SEQIDN08 - TGAGGGAAGGAGCTATTCTCCCTACCCGCTACACGTTGGACGGTTCGAAC -2835 
SEQIDN07 - AAATCATTGAACACGTACACGGACGAAGATCCGTTGGTGTTTGAAGTATT -2876 

II III IIIIIIIIIMM III MM MIIIIIIMIIII Mill 

SEQIDN08 - AAGTCAATGAACACGTACACAGACAAAGACCCGTTGGTGTTTGAGGTATT -2885 
SEQrDN07 - CCCCCTCGGAAACAACCGTGCCGACGGTATGTGTTATCTTGATGATGGCG -2926 

III II IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIII 

SEQIDN08 - CCCTCTTGGAAACAACCGTGCCGACGGTATGTGTTATCTTGATGATGGCG -2935 
SEQIDN07 - GTGTGACCACCAATGCTGAAGACAATGGCAAGTTCTCTGTCGTCAAGGTG -2976 

II I II II lllllll IM lllllll IIIIIIII MM II 

SEQIDN08 - GTATTACTACAGATGCTGAGGACCATGGCAAATTCTCTGTTATCAATGTC -2985 
SEQIDN07 - GCAGCGGAGCAGGATGGTGGTACGGAGACGATAACGTTTACGAATGATTG -3026 

I III III MM MM llllll I MM II MM 

SEQIDNOB - GAAGCCTTACGGAAAGGTGTTACGACGACGATCAAGTTTGCGTATGACAC -3035 
SEQrDN07 - CTATGAGTACGTTTTCGGTGGACCGTTCTACGTTCGAGTGCGCGGCGCTC -3076 

III I IIIM II I III II IIIIIMIIIII I M I 

SEQIDNOB - TTATCAATACGTATTTGATGGTCCATTCTACGTTCGAATCCGTAATCTTA -3085 
SEQIDN07 - AGTCGCCGTCGAACATCCACGTGTCTTCTGGAGCGGGTTCTCAGGACATG -3126 

II I II II II IIIIIIIIIIIIIIIIIIM IIIIIIII 

SEQIDNOB - CGACTGCATCAAAAATTAACGTGTCTTCTGGAGCGGGTGAAGAGGACATG -3135 



SEQIDN07 - AAGGTGAGCTCTGCCACTTCCAGGGCTGCGCTGTTCAATGACGGGGAGAA -3176 

I II mill I II mil II mm iii ii i 

SEQIDN08 - ACACCGACCTCTGCGAACTCGAGGGCAGCTTTGTTCAGTGATGGAGGTGT -3185 
SEQIDN07 - CGGTGATTTCTGGGTTGACCAGGAGACAGATTCTCTGTGGCTGAAGTTGC -3226 

II II I Mill im I M II iiiiiiiiii iiiiiim 

SEQIDN08 - TGGAGAATACTGGGCTGACAATGATACGTCTTCTCTGTGGATGAAGTTGC -3235 
SEQIDN07 - CCAACGTTGTTCTCCCGGACGCTGTGATCACAATTACCTAA -3267 

I III I iiiiM iiiiiiiim II mil II 

SEQIDN08 - CAAACCTGGTTCTGCAAGACGCTGTGATTACCATTACGTAG -3276 



Identity : 2550 (78.05%) 

Number of gaps inserted in SEQIDN07: 3 

Number of gaps inserted in SEQIDN08: 2 



==03-MAY-2001 



^NALIGN^ 



* ALIGNMENT OF TWO NUCLEOTIDE SEQUENCES. * 



The two sequences to be aligned are: 
SEQIDN07, 

Total number of bases: 3267. 
SEQIDN09. 

Total number of bases: 3201. 

Open gap cost : 50 

Unit gap cost : 20 



The character to show that two aligned residues are identical is ' |' 
SEQIDN07 - ATGTTTTCAACCCTTGCGTTTGTCGCACCTAGTGCGCTGGGAGCCAGTAC -50 

III I II III II Ml I 

SEQIDN09 - ATGGCAGGATTTTCTGATCCTCTCAACTTTTGCAAAGCAGAAGACTACTA -50 

SEQIDN07 - CTTCGTAGGGGCGGAGGTCAGGTCAAATGTTCGTATCCATTCCGCTTTTC -100 

I II I I II II I III I I 

SEQIDN09 - CAGTGTTGCGCTAGACTGGAAGGGCCCTCAAAAAATCATTGGAGTAGACA -100 

SEQIDN07 - CAGCTGTGCACACAGCTACTCGCAAAACCAATCGCCTCAATGTATCCATG -150 

I II I I II INI I III I II 

SEQIDN09 - CTACTCCTCCAAAGAGCACCAAGTTCCCCAAAAACTGGCATGGAG TG -147 

SEQIDN07 - ACCGCATTGTCCGACAAACAAACGGCTACTGCGGGTAGTACAGACAATCC -200 

II I III I II II II I I II 
SEQIDN09 - AACTTGAGATTCGATGATGGGACTTTAGGTGTGGTTCAGTTCATTAGGCC -197 

SEQIDN07 - GGACGGTATCGACTACAAGACCTACGATTACGTCGGAGTATGGGGTPTCA -250 

1 II I I I M I M II 

SEQIDN09 - GTGCGTTTGGAGGGTTAGATACGACCCTGGTTTCAAGACCTCTGACGAGT -247 
SEQIDN07 - GCCCCCTCTCCAACACGAACTGGTTTGCTGCCGGCTCTTCTACCCCGGGT -300 

II nil III I I I I I 

SEQIDN09 - ATGGTGATGAGAATACGAGGACAATTGTGCAAGATTATATGAGTACTCTG -297 

SEQIDN07 - GGCATCACTGATTGGACGGCTACAATGAATGTCAACTTCGACCGTATCGA -350 

III II II I I I II 

SEQIDN09 - AGTAATAAATTGGATACTTATAGAGGTCTTACGTGGGAAACCAAGTGTGA -347 

SEQIDN07 - CAATCCGTCCATCACTGTCCAGCATCCCGTTCAGGTTCAGGTCACGTCAT -400 

II II I I I I III I I II I III I II 
SEQIDN09 - GGATTCGGG-AGATTTCTTTACCTTCTCATCCAAGGTCACCGCCGTTGAA -396 

SEQIDN07 - ACAACAACAACAGCTACAGGGTTCGCTTCAACCCTGATGGCCCTATTCGT -450 

I M I I I I I II I II I INI 

SEQIDN09 - AAATC—CGAGCGGACCCGCAACAAGGTCGGCGATGGCCTCAGAATTCAC -444 



SEQIDN07 - GATGTGACTCGTGGGCCTATCCTCAAGCAGCAACTAGATTGGATTCGAAC -500 

III I II I I I II I 

SEQIDN09 - CTATGGAAAAGCCCTTTCCGCATCCAAGTAGTGCGCACCTTGACCCCTTT -494 



SEQIDN07 - GCAGGAGCTGTCAGAGGGATGTGATCCCGGAATGACTTTCACATCAGAAG -550 

I INI II II I I I I 

SEQIDN09 - GAAGGATCCTTACCCCATTCCAAATGTAGCCGCAGCCGAAGCCCGTGTGT -544 

SEQIDN07 - GTTTCTTGACTTTTGAGACCAAGGATCTAAGCGTCATCATCTACGGAAAT -600 

II II I II II I III I 

SEQrDN09 - CCGACAAGGTCGTTTGGCAAACGTCTCCCAAGACATTCAGAAAGAACCTG -594 

SEQIDN07 - TTCAAGACCAGAGTTACGAGAAAGTCTGACGGCAAGGTCATCATGGAAAA -650 

I I I Mil I I Mill II 

SEQrDN09 - CATCCGCAACACAAGATGCTAAAGGATACAGTTCTTGACATTGTCAAACC -644 

SEQIDN07 - TGATGAAGTTGGAACTGCATCGTCCGGGAACAAGTGCCGGGGATTGATGT -700 

II I I I II II II I I I I I II 
SEQIDNOg - TGGACATGGCGAGTATGTGGGGTGGGGAGAGATGGGAGGTATCCAGTTTA -694 

SEQrDN07 - TCGTTGATAGATTATACGGTAACGCTATCGCTTCCGTCAACAAGAACTTC -750 

I II i II I III II I II III II I M 

SEQIDN09 - TGAAGGAGCCAACATTCATGAACTATTTTAACTTCGACAATATGCAATAC -744 

SEQIDN07 - CGCAACGACGCGGTCAAGCAGGAGGGATTCTATGGTGCAGGTGAAGTCAA -800 

I III III I I I II II 
SEQrDN09 - CAGCAAGTCTATGCCCAAGGTGCTCTCGATTCTCGCGAG CCA -786 

SEQIDN07 - CTGTAAGTACCAGGACACCTACATCTTAGAACGCACTGGAATCGCCATGA -850 

Mill I I I I I mil I III II I II 

SEQIDN09 - CTGTACCACTCGGATCCCTTCTATCTTGATGTGAACTCCAACCCGGAGCA -836 

SEQIDN07 - CAAATTACAACTACGATAACTTGAACTATAACCAGTGGGACCTTAGACCT -900 

III III I I II I I I I I I I I I I I I I I II 
SEQIDN09 - CAAGA-ATATCACGGCAACCTTTATCGATAACTACTCTCAAATT-GCCAT -884 

SEQIDN07 - CCGCATCATGATGGTGCCCTCAACCCAGACTATTATATTCCAATGTACTA -950 

III II INI I I I I I II 

SEQIDN09 - CGACTTTGGAAAGACCAACTCAGGCTACATCAAGCTGGGAACCAGGTATG -934 
SEQIDN07 - CGCAGCACCTTGGTTGATCGTTAATGGATGCGCCGGTACTTCGGAGCAGT -1000 

I I III I I I I MM I I I II 

SEQIDN09 - GTGGTATCGATTGTTACGGTATCAGTGCGGATACGGTCCCG-GAAATTGT -983 

SEQIDN07 - ACTCGTATGGATGGTTCATGGACAATGTCTCTCAATCTTACATGAATACT -1050 

II I I I III II III I I III 
SEQIDN09 - ACGACTTTATACAGGTCTTGTTGGACGTTCAAAGTTGAAGCCCAGATATA -1033 

SEQIDN07 - GGAGATACTACCTGGAATTCTGGACAAGAGGACCTGGCATACATGGGCGC -1100 

II II III I I III II I I 

SEQIDN09 - TTCTCGGGGCCCATCAAGCCTGTT-ATGGATACCAACAGGAAAGTGACTT -1082 

SEQIDN07 - GCAGTATGGACCATTTGACCAACATTTTGTTTACGGTGCTGGGGGTGGGA -1150 



SEQIDN09 
SEQIDN07 



I I I II I I I I III! I MM 

GTATTCTGTGGTCCAGCAGTACCGTGACTGTAAATTTCCACTTGACGGGA -1 132 



TGGAATGTGTGGTCACAGCGTTCTCTCTTCTACAAGGCAAGGAGTTCGAG 

_.| .._| 1 — 1 1-1- 



-1200 



SEQIDN09 - TTCACGTCGATGTCGATGTTCAGGACGGCTTCAGAACTTTCACCACCAAC -1182 

SEQIDN07 - AACCAAGTTCTCAACAAACGTTCAGTAATGCCTCCGAAATACGTCTTTGG -1250 

II III III II III II III I 
SEQIDNGQ - CCACACACTTTCCCTAACCCCAAAGAGATGTTTACTAACTTGAGGAATAA -1232 

SEQIDN07 - TTTCTTCCAGGGTGTTTTCGGGACTTCTTCCTTGTTGAGAGCGCATATGC -1300 

I II II i I I II I I I M II 

SEQIDN09 - TGGAATCAAGTGCTCCACCAATATCACTCCTGTTATCAGCATTAACAACA -1282 
SEQIDN07 - CAGCAGGTGAGAACAACATCTCAGTCGAAGAAATTGTAGAAGGTTATCAA -1350 

II Mil III I I Mill III II II 

SEQIDN09 - GAGAGGGTGGATACAGTACCCTCCTTGAGGGAGTTGACAAAAAATACTTT -1332 

SEQIDN07 - AACAACAATTTCCCTTTCGAGGGGCTCGCTGTGGACGTGGATATGCAAGA -1400 

I II I I I II I I I 1 I I I 
SEQIDN09 - ATCATGGACGACAGATATACCGAGGGAACAAGTGGGAATGCGAAGGATGT -1382 

SEQIDN07 - CAACTTGCGGGTGTTCACCACGAAGGGCGAATTTTGGACCGCAAACAGGG -1450 

I I II I II I II II 

SEQIDN09 - TCGGTACATGTACTACGGTGGTGGTAATAAGGTTGAGGTCGATCCTAATG -1432 

SEQIDN07 - TGGGTACTGGCGGGGATCCAAACAACCGATCGGTTTTTGAATGGGCACAT -1500 

I II III II II I III 

SEQIDN09 - ATGTTAATGGTCGGCCAGACTTTAAAGACAACTATGACTTCCCCGCGAAC -1482 

SEQrDN07 - GACAAAGGCCTTGTTTGTCAGACAAATATAACTTGCTTCCTGAGGAATGA -1550 

III II II Mill III 

SEQIDN09 - TTCAACAGCAAACAATACCCCTATCATGGTGGTGTGAGCTACGGTTATGG -1532 
SEQrDN07 - TAACGAGGGGCAAGACTACGAGGTCAATCAGACGTTAAGGGAGAGGCAGT -1600 

MM I M I I Ml II MM I 

SEQIDN09 - GAACGGTAGTGCAGGTTTTTACCCGGACCTCAACAGAAAGGAGGTTCGTA -1582 
SEQIDN07 - TGTACACGAAGAACGACTCCCTGACGGGTACGGATTTTGGAATGACCGAC -1650 

II I I II I I III MM II 

SEQIDN09 - TCTGGTGGGGAATGCAGTACAAGTATCTCTTCGATATGGGACTGGAATTT -1632 
SEQIDN07 - GACGGCCCCAGCGATGCGTACATCGGTCATCTGGACTATGGGGGTGGAGT -1700 

I II II Ml III I I III M I II 

SEQIDN09 - GTGTGGCA-AGACATGACTACCCCAGCAATCCACACATCATATGGAGACA -1681 
SEQIDN07 - AGAATGTGATGCACTTTTCCCAGACTGGGGACGGCCTGACGTGGCCGAAT -1750 

MM I III I I II I I III 

SEQIDN09 - TGAAAGGGTTGCCCACCCGTCTACTCGTCACCTCAGACTCCGTCACCAAT -1731 
SEQIDN07 - GGTGGGGAAATAACTATAAGAAACTGTTCAGCATTGGTCTCGACTTCGTC -1800 

I III M I II I III II 

SEQIDN09 - GCCTCTGAGAAAAAGCTCGCAATTGAAACTTGGGCTCTCTACTCCTACAA -1781 



SEQIDN07 - TGGCAAGACATGACTGTTCCAGCAATGATGCCGCACAAAATTGGCGATGA -1850 

I I I I I I I II III III III 

SEQIDN09 - TCTCCACAAAGCAACTTGGCATGGTCTTAGTCGTCTCGAATCTCGTAAGA -1831 



SEQIDN07 - CATCAATGTGAAACCGGATGGGAATTGGCCGAATGCGGACGATCCGTCCA -1900 

I I Ml III I I I II M 

SEQIDN09 - ACAAACGAAACTTCATCCTCGGGCGIGGAAG—TTATGCCGGAGCCTATC -1879 
SEQIDN07 - ATGGACAATACAACTGGAAGACGTACCATCCCCAAGTGCTTGTAACTGAT -1950 

I lllll lllll M 

SEQIDN09 - GTTTTGCTGGTCTCTGGACTGGGGATAATGCAAGTAACTGGGAATTCTGG -1929 
SEQIDN07 - ATGCGTTATGAGAATCATGGTCGGGAACCGATGGTCACTCAACGCAACAT -2000 

I I I I III! II II III! Ill 

SEQIDN09 - AAGATATCGGTCTCTCAAGTTCTTTCTCTGGGCCTCAATGGTGTGTGCAT -1979 

SEQIDN07 - TCATGCGTATACACTGTGCGAGTCTACTAGGAAGGAAGGGATCGTGGAAA -2050 

III I lllll III llllll 

SEQIDN09 - CGCGGGGTCTGATACGGGTGGTTTT GAACCCTACCGTGATGCAAA -2024 

SEQIDN07 - ACGCAGACACTCTAACGAAGTTCCGCCGTAGCTACATTATCAGTCGTGGT -2100 

I I II I I II llllll I lllll I I 

SEQIDN09 - TGGGGTCGAGGAGAAATACTGTAGCCCAGAGCTAC-TCATCAGGTGGTAT -2073 

SEQIDN07 - GGTTACATTGGTAACCAGCATTTCGGGGGTATGTGGGTGGGAGACAACTC -2150 

I I II I II I I II 

SEQIDN09 - ACTGGTTCATTCCTCTTGCCGTGGCTCAGGAACCATTATGTCAAAAAGGA -2123 

SEQIDN07 - TACTACATCAAACTACATCCAAATGATGATTGCCAACAATATTAACATGA -2200 

I I II II II I MM llllll 

SEQIDN09 - CAGGAAATGGTTCCAGGAACCATACTCGTACCCCAAGCATCTTGAAACCC -2173 

SEQIDN07 - ATATGTCTTGCTTGCCTCTCGTCGGCTCCGACATTGGAGGATTCACCTCA -2250 

II I llllll III 

SEQIDN09 - ATCCAGAACTCGCAGACCAAGCATGGCTCTATAAATCCGTTTTGGAGATC -2223 

SEQIDN07 - TACGACAATGAGAATCAGCGAACGCCGTGTACCGGGGACTTGATGGTGAG -2300 

I II III I lllll 

SEQIDN09 - TGTAGGTACTATGTGGAGCTTAGATACTCCCTCATCCAACTACTTTACGA -2273 

SEQIDN07 - GTATGTGCAGGCGGGCTGCCTGTTGCCGTGGTTCAGGAACCACTATGATA -2350 

III I I I I I K I I 

SEQIDN09 - CTGCATGTTTCAAAACGTAGTCGACGGTATGCCAATCACCAGATCTATGC -2323 

SEQIDN07 - GGTGGATCGAGTCCAAGGACCACGGAAAGGACTACCAGGAGCTGTACATG -2400 

I II III I MM I I II II I I I 

SEQIDN09 - TCTTGACCGATACTGAGGATACCACCTTCTTCAACGAGAGCCAAAAGTTC -2373 

SEQIDN07 - TATCCGAATGAA-ATGGATACGTTGAGGAAGTTCGTTGAATTCCGTTATC -2449 

II II II I III II III I II I 

SEQIDN09 - CTCGACAACCAATATATGGCTGGTGACGACATTCTTGTTGCACCCATCCT -2423 



SEQIDN07 



- GCTGGCAGGAAGTGTTGTACACGGCCATGTACCAGAATGCGGCTTTCGGA -2499 



I I I I M III I III I II 

SEQIDN09 - CCACAGTCGCAAAGAAATTCCAGGCGAAAACAGAGATGTCTATCTCCCTC -2473 
SEQIDN07 - AAGCCGATTATCAAGGCTGCTTCGATGTACAATAACGACTCAAACGTTCG -2549 

M — l-^M — l-l-l — 1_|-| — 1-1-1 1 



SEQIDN09 - TTTACCACACCTGGTACCCCTCAAATTTGAGACCATGGGACGATCAAG-G -2522 
SEQIDN07 - CAGGGCGCAGAACGATCATTTCCTTCTTGGTGGACATGATGGATATCGCA -2599 

II I III I II nil I III 

SEQIDN09 - AGTCGCTTTGGGGAATCCTGTCGAAGGTGGTAGTGTCATCAATTATACTG -2572 
SEQIDN07 - TTCTGTGCGCGCCTGTTGTGTGGGAGAATTCGACCGAACGCGAATTGTAC -2649 

I I I I II I III I I III I MM 

SEQIDN09 - CTAGGATTGTTGCACCCGAGGATTATAATCTCTTCCACAGCG—TGGTAC -2620 
SEQIDN07 - TTGCCCGTGCTGACCCAATGGTACAAATTCGGTCCCGACTTTGACACCAA -2699 

II I I I II I I II MM 

SEQIDNG9 - CAGTCTACGTTAGAGAGGGTGCCATCATCCCGCAAATCGAAGTACGCCAA -2670 
SEQIDN07 - GCCTCTGGAAGGAGCGATGAACGGAGGGGACCGAATTTACAACTACCCTG -2749 

MM I I I I I I I I III Mill II I 

SEQIDN09 - TGGACTGGCCAGGGGGGAGC-CAACCGCATCAAGTTCAACATCTACCCTG -2719 
SEQIDN07 - TACCGCAAAGTGAATCACCAATCTTCGTGAGAGAAGGTGCGATTCTCCCT -2799 

I I II III I II I II MM I I I 

SEQIDN09 - GAAAGGATAAGGAGTACTGTACCTATCTTGATGATGGTGTTAGCCGTGAT -2769 

SEQIDN07 - ACCCGCTACACGTTGAACGGTGAAAACAAATCATTGAACACGTACACGGA -2849 

I III II II I III I II I 

SEQIDN09 - AGTGCGCC GGAAGACCTCCCACAGTACAAAGAGACCCA -2807 

SEQIDN07 - CGAAGATCCGTTGGTGTTTGAAGTATTCCCCCTCGGAAACAACCGTGCCG -2899 

Mil I II I I llllll III III I II 

SEQrDN09 - CGAACA GTCGAAGGTTGAAGGCGCGGAAATCGCAAAGCAGATTGGAA -2854 

SEQIDN07 - ACGGTATGTGTTATCTTGATGATGGCGGTGTGACCACCAATGCTGAAGAC -2949 

I I Mill II III I I MM 

SEQIDN09 - AGAAGACGGGTTACAACATCTCAGGA ACCGACCCAGAAGCA -2895 

SEQIDN07 - AATGGCAAGTTCTCTGTCGTCAAGGTGGCAGCGGAGCAGGATGGTGGTAC -2999 

II II I I II II II II I II 
SEQIDN09 - AAGGGTTATCACCGCAAAGTTGCTGTCACACAAACGTCAAAAGACAAGAC -2945 

SEQIDN07 - GGAGACGATAACGTTTACGAATGATTGCTATGAGTACGTTTTCGGTGGAC -3049 

I II I II II I II III MM I 

SEQIDN09 - GCGTACTGTCACTATTGAGCCAAAACACAATGGATACGACCCTTCCAAAG -2995 

SEQIDN07 - CGTTCTACGTTCGAGTGCGCGGCGCTCAGTCGCCGTCGAACATCCACGTG -3099 

II II llllll I III II 

SEQIDN09 - AGGTGGGTGATTATTATACCATCATTCTTTGGTACGCA CCAGGTT -3040 

SEQIDN07 - TCTTCTGGAGCGGGTTCTCAGGACATGAAGGTGAGCTCTGCCACTTCCAG -3149 

II I III M II III II III I I 

SEQIDN09 - TCGATGGCAGCATCGTC GATGTGAGCAAGACGACTGTGAATGTTGA -3086 



SEQIDN07 - GGCTGCGCTGTTCAATGACGGGGAGAACGGTGATTTCTGGGTTGACCAGG -3199 

II II I II I I I I II I I I 

SEQIDN09 - GGGTGGGGTGGAGCACCAAGTTTATAAGAACTCCGATTTACATACGGTTG -3136 



SEQIDN07 - AGACAGATTCTCTGTGGCTGAAGTTGCCCAACGTTGTTCTCCCGGACGCT -3249 

I II II III III II I II 

SEQIDN09 - TTATCGACGTGAAGGAGGTGATCGGTACCACAAAGAGCGTCAAGATCACA -3186 

SEQIDN07 - GTGATCACAATTACCTAA -3267 

II I III 

SEQIDN09 - TG TACTGCCGCTTAA -3201 



Identity : 1108 (34.61%) 

Number of gaps inserted in SEQIDN07: 1 

Number of gaps inserted in SEQIDN09: 21 
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* ALIGNMENT OF TWO NUCLEOTIDE SEQUENCES. * 



The two sequences to be aligned are: 
SEQIDN07. 

Total number of bases: 3267. 
SEQIDNOIO. 

Total number of bases: 3213. 

Open gap cost : 50 

Unit gap cost : 20 

The character to show that two aligned residues are identical is ' |' 

SEQIDN07 - ATGTTTTCAACCCTTGCGTTTGTCGCACCTAGTGCGCTGGGAGCCAGTAC -50 

III I III II II I III I I III 

SEQIDNOIO - ATGGCAGGATTA—TCCGACCCTCTCAATTTCTGCAAAGCAGAGGACTAC -48 

SEQIDN07 - CTTCGTAGGGGCGGAGGTCAGGTCAAATGTTCGTATCCATTCCGCTTTTC -100 

I II III I III I MM I 

SEQIDNOIO - TACGCTGCTGCCAAAGGCTGGAGTGGCCCTCAGAAGATCATTCGCTATGA -98 

SEQIDN07 - CAGCTGTGCACACAGCTACTCGCAAAACCAATCGCCTCAATGTATCCATG -150 

I II I I I I I I I I II I I 
SEQIDNOIO - CCA GACCCCTCCTCAGGGTACAAAAGATCCGAAAAGCTGGCATGC -143 

SEQIDN07 - ACCGCATTGTCCGACAAACAAACGGCTACTGCGGGTAGTACAGACAATCC -200 

till II M II I IIMI I! II 

SEQIDNOIO - GGTAAACCTTCCTTTCGATGACGGGACTATGTGTGTAGTGCAATTCGTCA -193 
SEQIDN07 - GGACGGTATCGACTACAAGACCTACGATTACGTCGGAGTATGGGGTTTCA -250 

II I I I I II II I I II I M 

SEQIDNOIO - GACCCTGT— GTTTGGAGGGTTAGATATGACCCCAGTGTCAAGACTTCTG -241 
SEQIDN07 - GCCCCCTCTCCAACACGAACTGGTTTGCTGCCGGCTCTTCTACCCCGGGT -300 

I II III I I II I I M I I 

SEQIDNOIO - ATGAGTACGGCGATGAGAA TACGAGGACTATTGTACAAGACTACAT -287 

SEQIDN07 - GGCATCACTGATTGGACGGCTACAATGAATGTCAACTTCGACCGTATCGA -350 

II MM Mill II I I I II 

SEQIDNOIO - GACTACTCTGGTTGGAAACTTGGACATTTTCAGAGGTCTTACGTGGGTTT -337 
SEQIDN07 - CAATCCGTCCATCACTGTCCAGCATCCCGTTCAGGTTCAGGTCACGTCAT -400 

II MM I i I M 1 1 I M 

SEQIDNOIO - CTACGTTGGAGGA TTCGGGCGAGTACTACACCTTCAAGTCCGAAGTC -384 



SEQIDN07 - ACAACAACAACAGCTACAGGGTTCGCTTCAACCCTGATGGCCCTATTCGT -450 

III I II I II II I I I II II 

SEQIDNOIO - ACTGCCGTGGACGAAACCGAACGGACTCGAAACAAGGTCGGCGACGGCCT -434 



SEQIDN07 - GATGTGACTCGTGGGCCTATCCTCAAGCAGCAACTAGATTGGATTCGAAC -500 

II I I I I I MM II I I I I 

SEQIDNOIO - CAAGATTTACCTATGGAAAAATCCCTTTCGCATCCAG GTAGTGCGTC -481 



SEQIDN07 - GCAGGAGCTGTCAGAGGGATGTGATCCCGGAATGACTTTCACATCAGAAG -550 

I II I I 1 III MM III I I I I I 

SEQIDNOIO - TCTTGACCCCCCTGGTGGACCCTTTCCCC— ATTCCCAACGTAGCCAATG -529 
SEQIDN07 - GTTTCTTGACTTTTGAGACCAAGGATCTAAGCGTCATCATCTACGGAAAT -600 

I I I Mill I I I I I I 

SEQIDNOIO - CCACAGCCCGTGTGGCCGACAAGGTTGTTTGGCAGACGTCCCCGAAGACG -579 
SEQIDN07 - TTCAAGACCAGAGTTACGAGAAAGTCTGACGGCAAGGTCATCATGGAAAA -650 

MM Mil MM I II 

SEQIDNOIO - TTCAGGAAAAACTTGCATCCGCAGCATAAGATGTTGAAGGATACAGTTCT -629 
SEQIDN07 - TGATGAAGTTGGAACTGCATCGTCCGGGAACAAGTGCCGGGGATTGATGT -700 

MM I M Mill IIMI MM 

SEQIDNOIO - TGATATTATCAAGCCGGGGCACGGAGAGTATGTGGGTTGGGGAGAGATGG -679 
SEQIDN07 - TCGTTGATAGATTATACGGTAACGCTATCGCTTCCGTCAACAAGAACTTC -750 

I M III I Mill 

SEQIDNOIO - GAGGCATCGAGTTTATGAAGGAGCCAACATTCATGAATTATTTCAACTTT -729 

SEQIDN07 - CGCAACGACGCGGTCAAGCAGGAGGGATTCTATGGTGCAGGTGAAGTCAA -800 

III llllll I II II III 

SEQIDNOIO - GACAATATGCAATATCAGCAGGTCTATGCACAAGGCGCTCTTGATAGTCG -779 

SEQIDN07 - CTGTAAGTACCAGGACACCTACATCTTAGAACGCACTGGAATCGCCATGA -850 

II I II I I llllll llllll 
SEQIDNOIO - TGAGCCGTTGTATCACTCTGATCCCTTCTATCTCGACGTGAACTCCAACC -829 

SEQIDN07 - CAAATTACAACTACGATAACTTGAACTATAACCAGTGGGACCTTAGACCT -900 

II I MM MM Mill II II I I 

SEQIDNOIO - CAGAGCACAAGAACATTACGGCAACCTTTATCGATAACTACTCTCAGATT -879 
SEQIDN07 - CCGCATCATGATGGTGCCCTCAACCCAGACTATTATATTCCAATGTACTA -950 

I I Ml MM Ml Ml II I MM I 

SEQIDNOIO - GCCATCGACTTTGGGAAGACCAACTCAGGCTAC-ATCAAGCTGGGTACCA -928 
SEQIDN07 - CGCAGCACCTTGGTTGATCGTTAATGGATGCGCCGGTACTTCGGAGCAGT -1000 

II Llllll MM III M I I III 

SEQIDNOIO - GGTATGGCGGTA-TCGATTGTTACGGTATCAGCGCGGATACGGTCCCGGA -977 
SEQIDN07 - ACTCGTATGGATGGTTCATGGACAATGTCTCTCAATCTTACATGAATACT -1050 

Mill I Mill III I II I MM I 

SEQIDNOIO - GATTGTGCGACTTTATACTGGACT-TGTTGGGCGTTCGAAGTTGAAGCCC -1026 
SEQIDN07 - GGAGATACTACCTGGAATTCTGGACAAGAGGACCTGGCATACATGGGCGC -1100 

I Ml I II I I I II 

SEQIDNOIO - AGGTATATTCTCGGAGCCCACCAAGCTTGTTATGGATACCAGCAGGAAAG -1076 
SEQIDN07 - GCAGTATGGACCATTTGACCAACATTTTGTTTACGGTGCTGGGGGTGGGA -1150 



II I III II II I ill II 

SEQIDNOlO - TGACTTGCATGCTGTTGTTCAGCAGTACCGTGACAC CAAGTTTCCG -1122 

SEQIDN07 - TGGAATGTGTGGTCACAGCGTTCTCTCTTCTACAAGGCAAGGAGTTCGAG -1200 

U4^IJ-I 1 141^1-^^1^^ — 



SEQIDNOlO - CTTGATGGGTTGCATGTCGATGTCGACTTTCAGGACAATTTCAGAACGTT -1172 
SEQIDN07 - AACCAAGTTCTCAACAAACGTTCAGTAATGCCTCCGAAATACGTCTTTGG -1250 

INI III I Mill Mill! Mill I 

SEQIDNOlO - TACCACTAACCCGATTA-CGTTCCCTAATCCCAAAGAAATGTTTACCAA- -1220 
SEQIDN07 - TTTCTTCCAGGGTGTTTTCGGGACTTCTTCCTTGTTGAGAGCGCATATGC -1300 

II I M M I III II III III 

SEQIDNOlO - TCTAAGGAACAATGGAATCAAGTGTTCCACCAACATCACCCCTGTTATCA -1270 
SEQIDN07 - CAGCAGGTGAGAACAACATCTCAGTCGAAGAAATTGTAGAAGGTTATCAA -1350 

Ml I I I I II M III I 

SEQIDNOlO - GTATCAGAGATCGCCCGAATGGGTACAGTACCCTCAATGAGGGATATGAT -1320 

SEQIDN07 - AACAACAATTTCCCTTTCGAGGGGCTCGCTGTGGACGTGGATATGCAAGA -1400 

11 II I I I I I II I I II II I II 

SEQIDNOlO - AAAAAGTACTTCATCATGGATGACAGATATACCGAGGGGACAAGTGGGGA -1370 

SEQIDN07 - CAACTTGCGGGTGTTCACCACGAAGGGCGAATTTTGGACCGCAAACAGGG -1450 

I II II II MM II II 

SEQIDNOlO - CCCGCAAAATGTTCGATACTCTTTTTACGGCGGTGGGAACCCGGTTGAGG -1420 

SEQIDN07 - TGGGTACTGGCGGGGATCCAAACAACCGATCGGTTTTTGAATGGGCACAT -1500 

I II I I I II II I II I 

SEQIDNOlO - TTAACCCTAATGATGTTTGGGCTCGGCCA GACTTTGGAGAC -1461 

SEQIDN07 - GACAAAGGCCTTGTTTGTCAGACAAATATAACTTGCTTCCTGAGGAATGA -1550 

I I II I I I M I II II III 

SEQIDNOlO - AATTATGACTTCCCTACGAACTTCAACTGCAAAGACTACCCCTATCATGG -1511 
SEQIDN07 - TAACGAGGGGCAAGACTACGAGGTCAATCAGACGTTAAGGGAGAGGCAGT -1600 

I I II II M II I I I 

SEQIDNOlO - TGGTGTGAGTTACGGATATGGGAATGGCACTCCAGGTTACTACCCTGACC -1561 
SEQIDN07 - TGTACACGAAGAACGACTCCCTGACGGGTACGGATTTTGGAATGACCGAC -1650 

I III II II I I I MM Mill I Ml I 

SEQIDNOlO - TTAACAGAGAGGAGG— TTCGTATCTGGTGGGGATTGCAGTACGAGTATC -1609 
SEQIDN07 - GACGGCCCCAGCGATGCGTACATCGGTCATCTGGACTATGGGGGTGGAGT -1700 

Ml I I I II 

SEQIDNOlO - TCTTCAATATGGGACTAGAGTTTGTATGGCAAGATATGACAACCCCAGCG -1659 
SEQIDN07 - AGAATGTGATGCACTTTTCCCAGACTGGGGACGGCCTGACGTGGCCGAAT -1750 

I Mil II III I II III I I 

SEQIDNOlO - ATCCATTCATCATATGGAGACATGAAAGGGTTGCCCACCCGTCTGCTCGT -1709 
SEQIDN07 - GGTGGGGAAATAACTATAAGAAACTGTTCAGCATTGGTCTCGACTTCGTC -1800 

I II II II I III I MM 

SEQIDNOlO - CACCGCCGACTCAGTTACCAATGCCTCTGAGAAAAAGCTCGCAATTGAAA -1759 



SEQIDN07 - TGGCAAGACATGACTGTTCCAGCAATGATGCCGCACAAAATTGGCGATGA -1850 

I MM I IN II IN I ~ 

SEQIDNOIO - GTTGGGCTCTTTACTCCTACAACCTCCATAAAGCAACCTTCCACGGTCTT -1809 
SEQIDN07 - CATCAATGTGAAACCGGATGGGAATTGGCCGAATGCGGACGATCCGTCCA -1900 

II III I I II III III! I 

SEQIDNOIO - GGTCGTCTTGAGTCTCGTAAGAACAAACGTAACTTC ATCCTCGGA -1854 

SEQIDN07 - ATGGACAATACAACTGGAAGACGTACCATCCCCAAGTGCTTGTAACTGAT -1950 

III III I II III 

SEQIDNOIO - CGTGGTAGTTACGCCGGTGCCTATCGTTTTGCTGGTCTCTGGACTGGAGA -1904 

SEQIDN07 - ATGCGTTATGAGAATCATGGTCGGGAACCGATGGTCACTCAACGCAACAT -2000 

II I I II MM II III I 

SEQIDNOIO - TAACGCAAGTACGTGGGAATTCTGGAAGATTTCGGTCTCCCAAGTTCTTT -1954 

SEQIDN07 - TCATGCGTATACACTGTGCGAGTCTACTAGGAAGGAAGGGATCGTGGAAA -2050 

I II I I III I II II II I I 

SEQIDNOIO - CTCTAGGTCTCAATGGTGTGTGTATAGCGGGGTCTGATACGGGTGGTTTT -2004 

SEQIDN07 - ACGCAGACACTCTAACGAAGTTCCGCCGTAGCTACATTATCAGTCGTGGT -2100 

II III II II II I I II I Ml Ml 

SEQIDNOIO - GAGCCCGCACG-TACTGAGATTGGGGAGGAGAAATATTGCAGTCCGGAGC -2053 
SEQIDN07 - GGTTACATTGGTAACCAGCATTTCGGGGGTATGTGGGTGGGAGACAACTC -2150 

I Ml I I II II I I 

SEQIDNOIO - TACTCATCAGGTGGTATACTGGATCATTCCTTTTGCCATGGCTTAGAAAC -2103 
SEQIDN07 - TACTACATCAAACTACATCCAAATGATGATTGCCAACAATATTAACATGA -2200 

Mill MM I II III III I 

SEQIDNOIO - CACTACGTCAAGAAGGACAGGAAATGGTTCCAGGAACCATACGCGTACCC -2153 
SEQIDN07 - ATATGTCTTGCTTGCCTCTCGTCGGCTCCGACATTGGAGGATTCACCTCA -2250 

I I II II MM I II I II 

SEQIDNOIO - CAAGCATCTTGAAACCCATCCAGAGCTCGCAGATCAAGCATGGCTTTACA -2203 
SEQIDN07 - TACGACAATGAGAA-TCAGCGAACGCCGTGTACCGGGGACTTGATGGTGA -2299 

I MM I II I I I III I III 

SEQIDNOIO - AATCTGTTCTAGAAATTTGCAGATACTGGGTAGAGCTAAGATATTCCCTC -2253 
SEQIDN07 - GGTATGTGCAGGCGGGCTGCCTGTTGCCGTGGTTCAGGAACCACTATGAT -2349 

II I I I M I MM 

SEQIDNOIO - ATCCAGCTCCTTTACGACTGCATGTTCCAAAACGTGGTCGATGGTATGCC -2303 
SEQIDN07 - AGGTGGATCGAGTCCAAGGACCACGGAAAGGACTACCAGGAGCTGTACAT -2399 

I II I II II I Ml II 

SEQIDNOIO - ACTTGCCAGATCTATGCTCTTGACCGATACTGAGGATACGACCTTCTTCA -2353 

SEQIDN07 - GTATCCGAATGAAATGGATACGTTGAGGAAGTTCGTTGAATTCCGTTATC -2449 

I I II I I I III I II III 

SEQIDNOIO - ATGAGAGCCAAAAGTTCCTCGATAACCAATATATGGCTGGTGACGACATC -2403 



SEQIDN07 - GCTGGCAGGAAGTGTTGTACACGGCCATGTACCAGAATGCGGCTTTCGGA -2499 



II I II I II III III I I M I 

SEQIDNOIO - -CTTGTAGCACCCATCCTCCACAGCCGTAACGAGGTTCCGGGAGAGAACA -2452 
SEQIDN07 - AAGCCGATTATCAAGGCTGCTTCGATGTACAATAACGACTCAAACGTTCG -2549 

LUJ-^LI-IJ l-LUl |_Ul__|.U_Uj_U|_.l 



SEQIDNOIO - GAGATGTCTATCTCCCTCTATTCCACACCTGGTACCC-CTCAAACTTGAG -2501 
SEQIDN07 - CAGGGCGCAGAACGATCATTTCCTTCTTGGTGGACATGATGGATATCGCA -2599 

I I I I I II I I II Ml I I I Ml 

SEQIDNOIO - ACCGTGGGACGATCAGGGAGTCGCTTTAGGGAATCCTGTCGAAGGTGGCA -2551 

SEQIDN07 - TTCTGTGCGCGCCTGTTGTGTGGGAGAATTCGACCGAACGCGAATTGTAC -2649 

II II II I I I II I I 
SEQIDNOIO - GCGTTATCAACTACACTGCCAGGATTGTTGCCCCAGAGGATTATAATCTC -2601 

SEQIDN07 - TTGCCCGTGCTGACCCAATGGTACAAATTCGGTCCCGACTTTGACACCAA -2699 

II I I III INI I III II 

SEQIDNOIO - TTCCACAACGTGGTGCCGGTCTACATCAGAGAGGGTGCCATCATTCCGCA -2651 
SEQIDN07 - GCCTCTGGAAGGAGCGATGAACGGAGGGGACCGAATTTACAACTACCCTG -2749 

II IN I I II II I I I I I II 

SEQIDNOIO - AATTCAGGTACGCCAGTGGATTGGCGAAGGAGGGCCTAATCCCATCAAGT -2701 

SEQIDN07 - TACCGCAAAGTGAATCACCAATCTTCGTGAGAGAAGGTGCGATTCTCCCT -2799 

I I I I I I III 

SEQIDNOIO - TCAATATCTACCCTGGAAAGGACAAGGAGTATGTGACGTACCTTGATGAT -2751 

SEQIDN07 - ACCCGCTACACGTTGAACGGTGAAAACAAATCATTGAACACGTACACGGA -2849 

I I I I I I II III 

SEQIDNOIO - GGTGTTAGCCGCGATAGTGCACCAGATGACCTCCCGCAGTACCGCGAGGC -2801 

SEQIDN07 - CGAAGATCCGTTGGTGTTTGAAGTATTCCCCCTCGGAAACAACCGTGCCG -2899 

I MM III MM III MM III I 

SEQIDNOIO - CTATGAGCAAGCGAAGGTCGAAGGCAAAGACGTCCAGAAGCAACTTGCGG -2851 
SEQIDN07 - ACGGTATGTGTTATCTTGATGATGGCGGTGTGACCACCAATGCTGAAGAC -2949 

I I I II Mill II II III I 

SEQIDNOIO - TCATTCAAGGGAATAAGACTAATGACTTCTCCGCCTCCGGGATTGATAAG -2901 
SEQIDN07 - AATGGCAAGTTCTCTGTCGTCAAGGTGGCAGCGGAGCAGGATGGTGGTAC -2999 

I I III I I I mil I I mil 

SEQIDNOIO - GAGGCAAAGGGTTATCACCGCAAAGTTTCTATCAAACAGGAGTCAAAAGA -2951 
SEQIDN07 - GGAGACGATAACGTTTACGAATGATTGCTATGAGTACGTTTTCGGTGGAC -3049 

MM II MM III II III I II 

SEQIDNOIO - CAAGACCCGTACTGTCACCATTGAGCCAAAACACAACGGATACGACCCCT -3001 
SEQIDN07 - CGTTCTACGTTCGAGTGCGCGGCGCTCAGTCGCCGTCGAACATCCACGTG -3099 

I I Ml I I I I I II I I 

SEQIDNOIO - CTAAGGAAGTTGGTAATTATTATACCATCATTCTTTGGTACGCACCGGGC -3051 
SEQIDN07 - TCTTCTGGAGCGGGTTCTCAGGACATGAAGGTGAGCTCTGCCACTTCCAG -3149 

II II I I I mill II I 

SEQIDNOIO - TTTGACGGCAGCATCGTCGATGTGAGCCAGGCGACCGTGAACATCGAGGG -3101 



SEQIDN07 - GGCTGCGCTGTTCAATGACGGGGAGAACGGTGATTTCTGGGTTGACCAGG -3199 

III Ml Mill III III 

SEQIDNOIO - CGGGGTGGAATGCGAAATTTTCAAGAACACCGGCTT GCATACGGTT -3147 

SEQIDN07 - AGACAGATTCTCTGTGGCTGAAGTTGCCCAACGTTGTTCTCCCGGACGCT -3249 

I I I . I I I I II II I I M 

SEQIDNOIO - GTAGTCAACGTGAAAGAGGTGATCGGTACCACAAAGTCCGTCAAGATCAC -3197 

SEQIDN07 - GTGATCACAATTACCTAA -3267 

II III I I II 
SEQIDNOIO - TTG~CACTACCGCTTAG -3213 

Identity : 1119 (34.83%) 

Number of gaps inserted in SEQIDN07: 1 

Number of gaps inserted in SEQIDNOIO: 21 



^NALIGN= 



* ALIGNMENT OF TWO NUCLEOTIDE SEQUENCES. * 

The two sequences to be aligned are: 
SEQIDN07. 

Total number of bases: 3267. 
SEQIDNOll. 

Total number of bases: 3279. 

Open gap cost : 50 

Unit gap cost : 20 

The character to show that two aligned residues are identical is ' T 
ATGTTTTCAACCCTTGCGTTTGTCGCACCTAGTGCGCTGGGAGCCAGTAC -50 

IIIIM { mil Ml Ml II II IIIMII lllll II 

ATGTTTCCTACCCTGACCTTCATAGCGCCCAGCGCGCTGGCCGCCAGCAC -50 



SEQIDN07 - 
SEQIDNOll - 



SEQIDN07 - CTTCGTAGGGGCGGAGGTCAGGTCAAATGTTCGTATCCATTCCGCTTTTC -100 

III II II Mill II I II MM II II IIMII III 

SEQIDNOll - CTTTGTGGGCGCGGATATCCGATCGGGCATTCGCATTCAATCCGCTCTTC -100 

SEQIDN07 - CAGCTGTGCACACAGCTACTCGCAAAACCAATCGCCTCAATGTATCCATG -150 

Ml llllll III MM lllll IIMIIIIIIIII 

SEQIDNOll - CGGCCGTGCGCAACGCTGTGCGCAGGAGCAAACATTACAATGTATCCATG -150 

SEQIDN07 - ACCGCATTGTCCGACAAACAAACGGCTACTGCGGGTAGTACAGACAATCC -200 

IMIIIIIIII lllll lllll MM I I I llllllll 

SEQIDNOll - ACCGCATTGTCTGACAAGCAAACCGCTATCAGTATTGGCCCTGACAATCC -200 

SEQIDN07 - GGACGGTATCGACTACAAGACCTACGATTACGTCGGAGTATGGGGTTTCA -250 

IMIMIMI lllll I I llllllllll II III III II I 

SEQIDNOll - GGACGGTATCAACTACCAAAACTACGATTACATCCCTGTAGCGGGCTTTA -250 

SEQIDN07 - GCCCCCTCTCCAACACGAACTGGTTTGCTGCCGGCTCTTCTACCCCGGGT -300 

IIMIMIIIMM lllllll IMIIIIIIIIIIII II lllll 

SEQIDNOll - CGCCCCTCTCCAACACCAACTGGTATGCTGCCGGCTCTTCCACTCCGGGC -300 

SEQIDN07 - GGCATCACTGATTGGACGGCTACAATGAATGTCAACTTCGACCGTATCGA -350 

MMIMI II Mill Mill Mlllllllll llllllll II II 

SEQIDNOll - GGCATCACCGACTGGACCGCTACCATGAATGTCAAATTCGACCGCATTGA -350 

SEQIDN07 - CAATCCGTCCATCACTGTCCAGCATCCCGTTCAGGTTCAGGTCACGTCAT -400 

MMIMII I I I lllll llllll lllllllllllll I 

SEQIDNOll - CAATCCGTCGTACTCCAATAACCATCCTGTTCAGATTCAGGTCACGTCGT -400 

SEQIDN07 - ACAACAACAACAGCTACAGGGTTCGCTTCAACCCTGATGGCCCTATTCGT -450 

ilMIIIIIIIIIII MM IIIIIIIIIIIIIIIIIMIM MUM 

SEQIDNOll - ACAACAACAACAGCTTCAGGATTCGCTTCAACCCTGATGGCCCCATTCGT -450 



SEQIDN07 - GATGTGACTCGTGGGCCTATCCTCAAGCAGCAACTAGATTGGATTCGAAC -500 

II II MM M IMIMII M IMIIMI IMIMMIM 

SEQIDNOll - GACGTCTCTCGAGGACCTATCCTGAAACAGCAACTCACTTGGATTCGAAA -500 
SEQIDN07 - GCAGGAGCTGTCAGAGGGATGTGATCCCGGAATGACTTTCACATCAGAAG -550 

lllllllll I llllllll MM MM III I I MM 

SEQIDNOll - CCAGGAGCTGGCGCAGGGATGTAATCCGAACATGAGCTTCTCTCCTGAAG -550 
SEQIDN07 - GTTTCTTGACTTTTGAGACCAAGGATCTAAGCGTCATCATCTACGGAAAT -600 

INI Ml lllllll MM! II nil III II IMIIIM II 

SEQIDNOll - GTTTTTTGTCTTTTGAAACCAAAGACCTAAACGnATAATCTACGGCAAC -600 
SEQIDN07 - TTCAAGACCAGAGTTACGAGAAAGTCTGACGGCAAGGTCATCATGGAAAA -650 

I Mill Hill MM III II Ml I II IMMII II 

SEQIDNOll - TGCAAGATGAGAGTCACGAAGAAG GATGGCTACCTCGTCATGGAGAA -647 

SEQIDN07 - TGATGAAGTTGGAACTGCATCGTCCGGGAACAAGTGCCGGGGATTGATGT -700 

III II I III II II Mill I IIIIIMIII 

SEQIDNOll - TGACGAGTGCAACTCGCAATCAGATGGCAATAAGTGTAGAGGATTGATGT -697 
SEQIDN07 - TCGTTGATAGATTATACGGTAACGCTATCGCTTCCGTCAACAAGAACTTC -750 

IIIIM I IIIMIIIII Mill llllllll I I III II 

SEQIDNOll - ACGTTGACCGGCTATACGGTAATGCTATTGCTTCCGTACAAACGAATTTT -747 
SEQIDN07 - CGCAACGACGCGGTCAAGCAGGAGGGATTCTATGGTGCAGGTGAAGTCAA -800 

I III Ml I II III IIIIIIIIIIIIIMIIIIIIIII 

SEQIDNOll - CACAAAGACACTTCTCGGAACGAGAAATTCTATGGTGCAGGTGAAGTCAA -797 
SEQIDN07 - CTGTAAGTACCAGGAC ACCTACATCTTAGAACGCA -835 

MM II nil II II I llllllll 

SEQIDNOll - CTGTCGCTATGAGGAGCAGGGTAAGGCGCCGACTTATGTTCTAGAACGCT -847 
SEQIDN07 - CTGGAATCGCCATGACAAATTACAACTACGATAACTTGAACTATAACCAG -885 

lllll IIIMIIIII llllllll Mill lllllllllll Mill 

SEQIDNOll - CTGGACTCGCCATGACCAATTACAATTACGACAACTTGAACTACAACCAA -897 
SEQIDN07 - TGGGACCTTAGACCTCCGCATCATGATGGTGCCCTCAACCCAGACTATTA -935 

III I lllll III II I III MM II 

SEQIDNOll - CCAGACGTCGTTCCTCCAGGTTATC CCGACCATCCCAACTACTA -941 

SEQIDN07 - TATTCCAATGTACTACGCAGCACCTTGGTTGATCGTTAATGGATGCGCCG -985 

lllllllllllllllllllllll IIIIM lllll I llllllll I 

SEQIDNOll - CATTCCAATGTACTACGCAGCACCGTGGTTGGTCGTTCAGGGATGCGCGG -991 
SEQIDN07 - GTACTTCGGAGCAGTACTCGTATGGATGGTTCATGGACAATGTCTCTCAA -1035 

I II III MM IMIIIM II Mill lllllllll llllllll 

SEQIDNOll - GGACATCGAAGCAATACTCGTACGGTTGGTTTATGGACAATGTCTCTCAG -1041 
SEQIDN07 - TCTTACATGAATACTGGAGATACTACCTGGAATTCTGGACAAGAGGACCT -1085 

II llllllll lllllllllll I Mill I Mill II MM 

SEQIDNOll - TCGTACATGAACACTGGAGATACGGCGTGGAACTGCGGACAGGAAAACCT -1091 
SEQIDN07 - GGCATACATGGGCGCGCAGTATGGACCATTTGACCAACATTTTGTTTACG -1135 



SEQIDNOll 

SEQIDN07 

SEQIDNOll 

SEQIDN07 

SEQIDNOll 

SEQIDN07 

SEQIDNOll 

SEQIDN07 

SEQIDNOll 

SEQIDN07 

SEQIDNOll 

SEQIDN07 

SEQIDNOll 

SEQIDN07 

SEQIDNOll 

SEQIDN07 

SEQIDNOll 

SEQIDN07 

SEQIDNOll 

SEQIDN07 

SEQIDNOll 

SEQIDN07 

SEQIDNOll 

SEQIDN07 

SEQIDNOll 

SEQIDN07 

SEQIDNOll 



IIIIIMIIIIIIIMII M II Mllllll II II mil II I 

GGCATACATGGGCGCGCAATACGGGCCATTTGATCAGCACTTTGTGTATG -1141 
GTGCTGGGGGTGGGATGGAATGTGTGGTCACAGCGTTCTCTCTTCTACAA - 11 85 

III III I III I III III MM MM II II I MM III 

GTGATGGAGATGGCCTTG AAG ATGTCGTC AAAGCGTTCTCCTTTCTTC AA - 1 1 9 1 
GGCAAGGAGTTCGAGAACCAAGTTCTCAACAAACGTTCAGTAATGCCTCC -1235 

II IIIIIIMIM II II IIIIIIM lllll IIIIIIIIIM 

GGAAAGGAGTTCGAAGACAAAAAACTCAACAAGCGTTCTGTAATGCCTCC -1241 
GAAATACGTCTTTGGTTTCTTCCAGGGTGTTTTCGGGACTTCTTCCTTGT - 1 285 

Ml lllll IMIIIMIIIIIIMMIIIIIMI I III 111 

GAAGTACGTGTTTGGTTTCTTCCAGGGTGTTTTCGGTGCACTTTCACTGT - 1 291 
TGAGAGCGCATATGCCAGCAGGTGAGAACAACATCTCAGTCGAAGAAATT - 1 335 

III I II MM II II IIIIIIIIIIIIMIM MM III 

TGAAGCAGAATCTGCCTGCCGGAGAGAACAACATCTCAGTGCAAGAGATT - 1 341 



GTAGAAGGTTATCAAAACAACAATTTCCCTTTCGAGGGGCTCGCTGTGGA 

M II lllll M I III I I III II II llllllll II II 

GTGGAGGGTTACCAGGATAACGACTACCCCTTTGAAGGGCTCGCGGTAGA 



-1385 



-1391 



CGTGGATATGCAAGACAACTTGCGGGTGTTCACCACGAAGGGCGAATTTT -1435 

M lllllllllll I MM lllll II II II MM II 

TGTTGATATGCAAGATGATCTGCGAGTGTTTACTACCAAACCAGAATATT -1441 
GGACCGCAAACAGGGTGGGTACTGGCGGGGATCCAAACAACCGATCGGTT -1485 

II I Mlllll III II Mill Mill M III MM II 

GGTCGGCAAACATGGTAGGCGAAGGCGGTGATCCTAATAACAGATCAGTC -1491 
TTTGAATGGGCACATGACAAAGGCCTTGTTTGTCAGACAAATATAACTTG -1535 

MIIMIMMIMMMI llllllll llllllll II Mlllll 

TTTGAATGGGCACATGACAGGGGCCTTGTCTGTCAGACGAACGTAACTTG -1541 
CTTCCTGAGGAATGATAACGAGGGGCAAGACTACGAGGTCAATCAGACGT -1585 

MM Mlllll IIMII III M lllll II IIIIIMI I 

CTTCTTGAGGAACGATAACAGTGGGAAACCATACGAAGTGAATCAGACAT -1591 
TAAGGGAGAGGCAGTTGTACACGAAGAACGACTCCCTGACGGGTACGGAT -1635 

I Mlllll llllllll llllllll II III III II III 

TGAGGGAGAAACAGTTGTATACGAAGAATGATTCCTTGAACAACACCGAT -1641 
TTTGGAATGACCGACGACGGCCCCAGCGATGCGTACATCGGTCATCTGGA -1685 

Mlllll III M II II Mlllllllllll II III MM 

TTTGGAACTACCTCGGATGGGCCTGGCGATGCGTACATTGGACATTTGGA -1691 
CTATGGGGGTGGAGTAGAATGTGATGCACTTTTCCCAGACTGGGGACGGC -1735 

IMMI IIIIMII II IIMMIII I llllllllllllll II I 

CTATGGTGGTGGAGTGGAGTGTGATGCAATCTTCCCAGACTGGGGTCGAC -1 741 
CTGACGTGGCCGAATGGTGGGGAAATAACTATAAGAAACTGTTCAGCATT -1785 

I IMIIIII IIIIIIIIIM I lllll lllll IIIMIIIIIII 

CAGACGTGGCTCAATGGTGGGGAGAAAACTACAAGAAGCTGTTCAGCATT -1791 



SEQIDN07 - GGTCTCGACTTCGTCTGGCAAGACATGACTGTTCCAGCAATGATGCCGCA -1835 

IIIMIII Mill lllll M Mill II II II lllllllllll 

SEQIDNOll - GGTCTCGATTTCGTGTGGCAGGATATGACGGTACCTGCGATGATGCCGCA -1841 
SEQIDN07 - CAAAATTGGCGATGACATCAATGTGAAACCGGATGG GAATT -1876 

I II II MM MM II Ml II I I 

SEQIDNOll - CCGACTCGGTGATGCTGTCAACAAAAATTCCGGTAGTTCGGCGCCGGGCT -1891 
SEQIDN07 - GGCCGAATGCGGACGATCCGTCCAATGGACAATACAACTGGAAGACGTAC -1926 

IMMMII I IIMIII Mill MM IIIIMIIIIII I II 

SEQIDNOll - GGCCGAATGAGAACGATCCATCCAACGGACGATACAACTGGAAATCTTAT -1941 
SEQIDN07 - CATCCCCAAGTGCTTGTAACTGATATGCGTTATGAGA ATCATGGTCG -1973 

lllll MIMIII II II II lllll MM I MM I 

SEQIDNOll - CATCCGCAAGTGCTCGTGACCGACATGCGCTATGGTGCAGAGTATGGAAG -1991 
SEQIDN07 - GGAACCGATGGTCACTCAACGCAACATTCATGCGTATACACTGTGCGAGT -2023 

IIIMIIIIIII IIIIIIMIIIIMII II M II II II II I 

SEQIDNOll - GGAACCGATGGTGTCTCAACGCAACATTCACGCCTACACTCTTTGTGAAT -2041 
SEQIDN07 - CTACTAGGAAGGAAGGGATCGTGGAAAACGCAGACACTCTAACGAAGTTC -2073 

MM II Mi II II MM IIIIIMIMI I I II llllll 

SEQIDNOll - CTACCAGACGGGAGGGAATTGTGGGAAACGCAGACAGTTTGACCAAGTTC -2091 
SEQIDN07 - CGCCGTAGCTACATTATCAGTCGTGGTGGTTACATTGGTAACCAGCATTT -2123 

lllll II Mill IIIIIMI II MIMMI MIIIIIIMIIII 

SEQIDNOll - CGCCGCAGTTACATCATCAGTCGAGGAGGTTACATCGGTAACCAGCATTT -2141 
SEQIDN07 - CGGGGGTATGTGGGTGGGAGACAACTCTACTACATCAAACTACATCCAAA -2173 

Ml II IIIMIII II llllll I I III I MM MUM 

SEQIDNOll - CGGAGGGATGTGGGTTGGGGACAACAGTGCCACAGAATCCTACCTCCAAA -2191 
SEQIDN07 - TGATGATTGCCAACAATATTAACATGAATATGTCTTGCTTGCCTCTCGTC -2223 

Mill I II MM III IIMMIIIIIIM III I M II M 

SEQIDNOll - TGATGTTGGCGAACATTATCAACATGAATATGTCGTGCCTCCCGCTAGTT -2241 
SEQIDN07 - GGCTCCGACATTGGAGGATTCACCTCATACGACAATGAGAATCAGCGAAC -2273 

Mill II lllll II llllll Ml I MM II III 

SEQIDNOll - GGCTCTGATATTGGCGGGTTCACCCAGTACAATGATGCGGGCGACCCAAC -2291 
SEQIDN07 - GCCGTGTACCGGGGACTTGATGGTGAGGTATGTGCAGGCGGGCTGCCTGT -2323 

M II Ml MIMIII II I IIIIIMI IIIM III 

SEQIDNOll - CCC CGAGGATTTGATGGTAAGATTCGTGCAGGCTGGCTGTCTGC -2335 

SEQIDN07 - TGCCGTGGTTCAGGAACCACTATGATAGGTGGATCGAGTCCAAGGACCAC -2373 

I IIIMIIIIII lllllllllll IIIMIII MIIIIMI I III 

SEQIDNOll - TACCGTGGTTCAGAAACCACTATGACAGGTGGATTGAGTCCAAGAAGCAC -2385 
SEQIDN07 - GGAAAGGACTACCAGGAGCTGTACATGTATCCGAATGAAATGGATACGTT -2423 

II Ml I MIIIIMI I MIMIII Ml III IIIMIIII 

SEQIDNOll - GGGAAGAAATACCAGGAGTTATACATGTACCCGGGGCAAAAGGATACGTT -2435 
SEQIDN07 - GAGGAAGTTCGTTGAATTCCGTTATCGCTGGCAGGAAGTGTTGTACACGG -2473 



SEQIDNOll 



II IIIIIIIIIIIMMIII M lllllllllll II llllllll I 

GAAGAAGTTCGTTGAATTCCGCTACCGCTGGCAGGAGGTTTTGTACACAG 



-2485 



SEQIDN07 - CCATGTACCAGAATGCGGCTTTCGGAAAGCCGATTATCAAGGCTGCTTCG -2523 

III.IJII.IJI Mill .1 .. ILLJIJ 11.11 IJIIIIII II L 

SEQIDNOll - CCATGTACCAAAATGCTACCACTGGAGAGCCGATCATCAAGGCGGCGCCC -2535 

SEQIDN07 - ATGTACAATAACGACTCAAACGTTCGCAGGGCGCAGAACGATCATTTCCT -2573 

llllllll mill Hill I iiiiiii {iiiiiiiiii 

SEQIDNOll - ATGTACAACAACGACGTCAACGTGTATAAATCGCAGAATGATCATTTCCT -2585 

SEQIDN07 - TCTTGGTGGACATGATGGATATCGCATTCTGTGCGCGCCTGTTGTGTGGG -2623 

III lllllllllll II Mill Mill Mill lllllllll I I 

SEQIDNOll - TCTCGGTGGACATGACGGCTATCGTATTCTCTGCGCACCTGTTGTGCGCG -2635 

SEQIDN07 - AGAATTCGACCGAACGCGAATTGTACTTGCCCGTGCTGACCCAATGGTAC -2673 

I III MM llllll Mill MM III III IIIU 

SEQIDNOll - AAAATGCGACAAGTCGCGAAGTGTACCTGCCTGTGTATAGCAAGTGGTTC -2685 

SEQIDN07 - AAATTCGGTCCCGACTTTGACACCAAGCCTCTGGAAGGAGCGATGAACGG -2723 

llllllll II lllllllllll Mill Mill I III I ir 

SEQIDNOll - AAATTCGGACCGGACTTTGACACTAAGCCCTTGGAAAATGAGATTCAAGG -2735 

SEQIDN07 - AGGGGACCGAATTTACAACTACCCTGTACCGCAAAGTGAATCACCAATCT -2773 

III I MM II III III Mill I II II II II I 

SEQIDNOll - AGGTCAGACGCTTTATAATTACGCTGCACCGCTGAACGATTCGCCGATAT -2785 

SEQIDN07 - TCGTGAGAGAAGGTGCGATTCTCCCTACCCGCTACACGTTGAACGGTGAA -2823 

I Mill Mill I Mill II II II llllll II Mini 

SEQIDNOll - TTGTGAGGGAAGGGACTATTCTTCCGACACGGTACACGCTGGACGGTGTG -2835 

SEQIDN07 - AACAAATCATTGAACACGTACACGGACGAAGATCCGTTGGTGTTTGAAGT -2873 

llllllll I lllllllllll III I llllll I II Mill I 

SEQIDNOll - AACAAATCTATCAACACGTACACAGACAATGATCCGCTTGTATTTGAGCT -2885 

SEQIDN07 - ATTCCCCCTCGGAAACAACCGTGCCGACGGTATGTGTTATCTTGATGATG -2923 

Mill MM llllllll M I II III MM llllllll 

SEQIDNOll - GTTCCCTCTCGAAAACAACCAGGCGCATGGCTTGTTCTATCATGATGATG -2935 

SEQIDN07 - GCGGTGTGACCACCAATGCTGAAGACAATGGCAAGTTCTCTGTCGTCAAG -2973 

IIIIIII llllllll lllllllll IIIMIII Mill III 

SEQIDNOll - GCGGTGTCACCACCAACGCTGAAGACTTTGGCAAGTATTCTGTGATCAGT -2985 

SEQIDN07 - GTGGCAGCGGAGCAGGATGGTGGTACGGAGACGATAACGTTTACGAATGA -3023 

Ml II I llllll III I II I I MM Mill 

SEQIDNOll - GTGAAGGCCGCGCAGGAAGGTTCTCAAATGAGTGTCAAGTTTGACAATGA -3035 

SEQIDN07 - TTGCTATGAGTACGTTTTCGGTGGACCGTTCTACGTTCGAGTGCGCGGCG ~3073 

Mill II I II II IIIIIII Mill II II 

SEQIDNOll - AGTTTATGAACACCAATGGGGAGCATCGTTCTATGTTCGTGTTCGTAATA -3085 

SEQIDN07 - CTCAGTCGCCGTCGAACATCCACGTGTCTTCTGGAGCGGGTTCTCAGGAC -3123 

I Mill IMMI MM MUM III llllll 

SEQIDNOll - TGGGTGCTCCGTCTAACATCAACGTATCTTCTCAGATTGGTCAACAGGAC -3135 



SEQIDN07 - ATGAAGGTGAGCTCTGCCACTTCCAGGGCTGCGCTGTTCAATGACGGGGA -3173 

III I Mill! I I lllllllll Mllll I I I 

SEQIDNOll - ATGCAACAGAGCTCCGTGAGTTCCAGGGCGCAAATGTTCACTAGTGCTAA -3185 

SEQIDN07 - GAACGGTGATTTCTGGGTTGACCAGGAGACAGATTCTCTGTGGCTGAAGT -3223 

I I I I I II II I I I I I I I I I I II III I I I I I I I I II I 
SEQIDNOll - CGATGGCGAGTACTGGGTTGACCAGAGCACGAACTCGTTGTGGCTCAAGT -3235 

SEQIDN07 - TGCCCAACGTTGTTCTCCCGGACGCTGTGATCACAATTACCTAA -3267 

INI I III Hi lllllll Mllll II I I 

SEQIDNOll - TGCCTGGTGCAGTTATCCAAGACGCTGCGATCACTGTTCGTTGA -3279 



Identity : 2450 (74.99%) 

Number of gaps inserted in SEQIDN07: 3 

Number of gaps inserted in SEQIDNOll: 3 



===03-MAY-2001==================NALIGN===========================PC/GENE=== 



* ALIGNMENT OF TWO NUCLEOTIDE SEQUENCES. * 

The two sequences to be aligned are: 
SEQIDN07. 

Total number of bases: 3267. 
SEQIDN0I2. 

Total number of bases: 1712, 

Open gap cost : 50 

Unit gap cost : 20 

The character to show that two aligned residues are identical is ' |' 

SEQIDN07 - ATGTTTTCAACCCTTGCGTTTGTCGCACCTAGTGCGCTGGGAGCCAGTAC -50 
III 

SEQIDN012 - ATG 3 

SEQIDN07 - CTTCGTAGGGGCGGAGGTCAGGTCAAATGTTCGTATCCATTCCGCTTTTC -100 



SEQIDN012 

SEQIDN07 

SEQIDN012 

SEQIDN07 

SEQIDN012 



CAGCTGTGCACACAGCTACTCGCAAAACCAATCGCCTCAATGTATCC ATG - 1 50 
3 

ACCGCATTGTCCGACAAACAAACGGCTACTGCGGGTAGTACAGACAATCC -200 
3 



SEQIDN07 - GGACGGTATCGACTACAAGACCTACGATTACGTCGGAGTATGGGGTTTCA -250 



SEQIDN012 



SEQIDN07 - GCCCCCTCTCCAACACGAACTGGTTTGCTGCCGGCTCTTCTACCCCGGGT -300 



SEQIDN012 

SEQIDN07 

SEQIDN012 



GGCATCACTGATTGGACGGCTACAATGAATGTCAACTTCGACCGTATCGA -350 
3 



SEQIDN07 - CAATCCGTCCATCACTGTCCAGCATCCCGTTCAGGTTCAGGTCACGTCAT -400 
SEQIDN012 3 



SEQIDN07 - ACAACAACAACAGCTACAGGGTTCGCTTCAACCCTGATGGCCCTATTCGT -450 



SEQIDN012 



-3 



SEQIDN07 

SEQIDN012 

SEQIDN07 

SEQIDN012 

SEQIDN07 

SEQIDN012 

SEQIDN07 

SEQIDN012 

SEQIDN07 

SEQIDN012 

SEQIDN07 

SEQIDN012 

SEQIDN07 

SEQIDN012 

SEQIDN07 

SEQIDN012 

SEQIDN07 

SEQIDN012 

SEQIDN07 

SEQIDN012 

SEQIDN07 

SEQIDN012 

SEQIDN07 

SEQIDN012 

SEQIDN07 

SEQIDN012 

SEQIDN07 



GATGTGACTCGTGGGCCTATCCTCAAGCAGCAACTAGATTGGATTCGAAC -500 



GCAGGAGCTGTCAGAGGGATGTGATCCCGGAATGACTTTCACATCAGAAG -550 



-3 



- GTTTCTTGACTTTTGAGACCAAGGATCTAAGCGTCATCATCTACGGAAAT -600 



TTCAAGACCAGAGTTACGAGAAAGTCTGACGGCAAGGTCATCATGGAAAA -650 



TGATGAAGTTGGAACTGCATCGTGCGGGAACAAGTGCCGGGGATTGATGT -700 



-3 



TCGTTGATAGATTATACGGTAACGCTATCGCTTCCGTCAACAAGAACTTC -750 



- CGCAACGACGCGGTCAAGCAGGAGGGATTCTATGGTGCAGGTGAAGTCAA -800 



CTGTAAGTACCAGGACACCTACATCTTAGAACGCACTGGAATCGCCATGA -850 



-A -4 



CAAATTACAACTACGATAACTTGAACTATAACCAGTGGGACCTTAGACCT -900 

nil II II II II II 1 1 mil I II II iiiiiM II 

CAAACTATAATTATGACAATTTGAACTACAATCAACCGGACCTCATCCCA -54 

CCGCATCATGATGGTGCCCTCAACCCAGACTATTATATTCCAATGTACTA -950 

II mill I I II mil iiiiiiii IIIIIM 

CCTGGCCATGATTCAG ATCCTGACTACTATATTCCGATGTACTT -98 

CGCAGCACCTTGGTTGATCGTTAATGGATGCGCCGGTACTTCGGAGCAGT - 1 000 

II mil III mm mm ii ii ii mi 

TGCGGCACCATGGGTGATCGCACATGGATATCGTGGCACCAGCGACCAGT - 1 48 

ACTCGTATGGATGGTTCATGGACAATGTCTCTCAATCTTACATGAATACT - 1 0 50 

MM lllllllllll llllllllll II M II MM II III 

ACTCTTATGGATGGTTTTTGGACAATGTATCCCAGTCCTACACAAACACT -198 

GGAGATACTACCTGGAATTCTGGACAAGAGGACCTGGCATACATGGGCGC -1 1 00 

II Ml I I III MM II MM Ml! Mllllll M 

GGCGATGATGCATGG GCTGGTCAGAAGGATTTGGCGTACATGGGGGC -245 

GCAGTATGGACCATTTGACCAACATTTTGTTTACGGTGCTGGGGGTGGGA -1 1 50 



SEQIDN012 

SEQIDN07 

SEQIDN012 

SEQIDN07 

SEQIDN012 

SEQIDN07 

SEQIDN012 

SEQIDN07 

SEQIDN012 

SEQIDN07 
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SEQIDN07 
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SEQIDN012 

SEQIDN07 
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SEQIDN07 

SEQIDN012 

SEQIDN07 

SEQIDN012 

SEQIDN07 

SEQIDN012 

SEQIDN07 

SEQIDN012 

SEQIDN07 

SEQIDN012 



II I III II II M IIIIIIIMII III Mill I III 

ACAATGTGGGCCTTTCGATCAACATTTTGTGTATGAGGCTGGAGATGGAC -295 
TGGAATGTGTGGTCACAGCGTTCTCTCTTCTACAAGGCAAGGAGTTCGAG -1200 

I. Ill II M INI llllll I \ IIIIIIIMII I III 

TTGAAGACGTTGTGACCGCATTCTCTTATTTGCAAGGCAAGGAATATGAG -345 
AACCAAGTTCTCAACAAACGTTCAGTAATGCCTCCGAAATACGTCTTTGG -1 250 

Mill I II III MUM I IIMIIIIIIII Mill II II 

AACCAGGGACTGAATATACGTTCTGCAATGCCTCCGAAGTACGTTTTCGG -395 
TTTCTTCCAGGGTGTTTTCGGGACTTCTTCCTTGTTGAGAGCGCATATGC -1 300 

II Mill M II Mill I I II II Ml I III 

ATTTTTCCAAGGCGTATTCGGAGCCACATCGCTGCTAAGGGACAACTTAC -445 
CAGCAGGTGAGAACAACATCTCAGTCGAAGAAATTGTAGAAGGTTATCAA -1 350 

I II II IMIIIIII MM I MIMIIIIII Mill IIMII 

CTGCCGGCGAGAACAACGTCTCTTTGGAAGAAATTGTTGAAGGATATCAA -495 
AACAACAATTTCCCTTTCGAGGGGCTCGCTGTGGACGTGGATATGCAAGA -1 400 

II III I II II II M II IIMIIII II IIIIIIIMII 

AATCAGAACGTGCCATTTGAAGGTCTTGCTGTGGATGTTGATATGCAAGA -545 
CAACTTGCGGGTGTTCACCACGAAGGGCGAATTTTGGACCGCAAACAGGG -1450 

Mill I llllllll MM I MMIIII IIMIII II 

TGACTTGAGAGTGTTCACTACGAGACCAGCGTTTTGGACGGCAAACAAGG -595 
TGGGTACTGGCGGGGATCCAAACAACCGATCGGTTTTTGAATGGGCACAT -1 500 

MM Mill llllllllllll II II Mill IIMIIIII 

TGGGGGAAGGCGGTGATCCAAACAACAAGTCAGTGTTTGAGTGGGCACAT -645 
GACAAAGGCCTTGTTTGTCAGACAAATATAACTTGCTTCCTGAGGAATGA -1550 

MM MMIIII II Mill III MIMIIIIII III III II 

GACAGGGGCCTTGTCTGCCAGACGAATGTAACTTGCTTCTTGAAGAACGA -695 
TAACGAGGGGCAAGACTACGAGGTCAATCAGACGTTAAGGGAGAGGCAGT -1600 

II I Mill II Mini I II IMIMI Mill 

GAAAAA TCCTTACGAAGTGAATCAGTCATTGAGGGAGAAGCAGT -739 

TGTACACGAAGAACGACTCCCTGACGGGTACGGATTTTGGAATGACCGAC - 1 650 

MM MIIIM II III II I MIMIIIII II 

TGTATACGAAGAGTGATTCCTTGGACAACATTGATTTTGGAACTACTCCA -789 
GACGGCCCCAGCGATGCGTACATCGGTCATCTGGACTATGGGGGTGGAGT -1700 

II M II llllllllllllll II II I Mill M Mill II 

GATGGGCCTAGCGATGCGTACATTGGACACTTAGACTACGGTGGTGGTGT -839 
AGAATGTGATGCACTTTTCCCAGACTGGGGACGGCCTGACGTGGCCGAAT -1750 

II IMIIIIIIM llllllllllllil II II llllllll Ml 

GGAGTGTGATGCACTATTCCCAGACTGGGGTCGACCAGACGTGGCTCAAT -889 
GGTGGGGAAATAACTATAAGAAACTGTTCAGCATTGGTCTCGACTTCGTC -1800 

MIIIM IIMIII llllllll IIIIIIIIIIIIIMII MUM 

GGTGGGGCGATAACTACAAGAAACTATTCAGCATTGGTCTCGATTTCGTC -939 



# 



SEQIDN07 

SEQIDN012 

SEQIDN07 
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SEQIDN07 
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SEQIDN07 

SEQIDN012 

SEQIDN07 
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TGGCAAGACATGACTGTTCCAGCAATGATGCCGCACAAAATTGGCGATGA 

llllllll IIMI II II M IIIIMIIIIM I I Mil 

TGGCAAGATATGACGGTACCTGCGATGATGCCGCACCGACTCGGTGACCC 



-1850 



-989 



- CATCAATGTGAAACCGGATGGGAAT TGGCCGAATGCGGACG -1891 

II II II II II llllllllll I I 

- TGTCGGCACAAATTCCGGTGAGACGGCGCCGGGCTGGCCGAATGATAAGG -1039 

- ATCCGTCCAATGGACAATACAACTGGAAGACGTACCATCCCCAAGTGCTT -1941 

MM mil MM llllll IMIII I IMIIIII llllllll 

- ATCCATCCAACGGACGATACAATTGGAAGTCTTACCATCCGCAAGTGCTC -1089 

- GTAACTGATATGCGTTATGAGAATCATGGTCGGGAACCGATGGTCACTCA -1991 

II IIMI III I Mill II I II I II II II M II II 

- GTGACTGACATGAGGTATGACGATTACGGAAGAGATCCCATTGTTACGCA -1139 

- ACGCAACATTCATGCGTATACACTGTGCGAGTCTACTAGGAAGGAAGGGA -2041 

MUM I Mill II II II II IIIIIMIIIIII llllll I 

- ACGCAATCTCCATGCCTACACTCTTTGTGAGTCTACTAGGAGGGAAGGCA -1189 

- TCGTGGAAAACGCAGACACTCTAACGAAGTTCCGCCGTAGCTACATTATC -2091 

I II 1 lllllllll I III IIIIIIIIIIMII Mill llllll 

- TTGTTGGAAACGCAGATAGTCTGACGAAGTTCCGCCGCAGCTATATTATC -1239 

- AGTCGTGGTGGTTACATTGGTAACCAGCATTTCGGGGGTATGTGGGTGGG -2141 

IMIIMI II IIMI Mill Mill II II II llllllll II 

- AGTCGTGGAGGCTACATCGGTAATCAGCACTTTGGTGGGATGTGGGTAGG -1289 

- AGACAACTCTACTACATCAAACTACATCCAAATGATGATTGCCAACAATA -2191 

llllllllll MM llllllll 1 1 III 1 1 II MM II 

- AGACAACTCTTCTACGGAAGACTACCTCGCAATGATGGTTATCAACGTTA -1339 

- TTAACATGAATATGTCTTGCTTGCCTCTCGTCGGCTCCGACATTGGAGGA -2241 

I llllllll Mill I I II IIMI II Mill llllllll 

- TCAACATGAACATGTCCGGTGTCCCGCTCGTTGGTTCCGATATTGGAGGT -1389 

- TTCACCTCATACGACAATGAGAATCAGCGAACGCCGTGTACCGGGGACTT -2291 

Mill I Mill I III II II M llllll 

- TTCACGGAGCATGACAA GAGAAACCCTTGCACACCGGACTT -1430 

- GATGGTGAGGTATGTGCAGGCGGGCTGCCTGTTGCCGTGGTTCAGGAACC -2341 

MM MM I IIIIMIII II Ml II I IIIIIMIIIIMIII 

- GATGATGAGATTTGTGCAGGCTGGATGCTTGCTACCGTGGTTCAGGAACC -1480 

- ACTATGATAGGTGGATCGAGTCCAAGGACCACGGAAAGGACTACCAGGAG -2391 

MM Mlllllllllllll MM I IIMIIIII llllill Ml 

- ACTACGATAGGTGGATCGAGAGCAAGAAACACGGAAAGAACTACCAAGAG -1530 

- CTGTACATGTATCCGAATGAAATGGATACGTTGAGGAAGTTCGTTGAATT -2441 

IIMMIIII I 11 MM llllill llllllll 

- TTGTACATGTACCGCGACCACTTGGACGCCTTGAGAAGTTTTGTGGAACT -1580 

- CCGTTATCGCTGGCAGGAAGTGTTGTACACGGCCATGTACCAGAATGCGG -2491 



Ill lllllllllllillllllll Mill IIIMIII IIIIMII 

SEQIDN012 - CCGCTATCGCTGGCAGGAAGTGTTATACACAGCCATGTATCAGAATGCTT -1630 



SEQIDN07 - CTTTCGGAAAGCCGATTATCAAGGCTGCTTCGATGTACAATAACGACTCA -2541 

III IIIIMII Mill II .11 Mill I M Mill 

SEQrDN012 - TGAACGGGAAGCCGATCATCAAAACGGTCTCCATGTACAACAACGATATG -1680 
SEQIDN07 - AACGTTCGCAGGGCGCAGAACGATCATTTCCTTCTTGGTGGACATGATGG -2591 

Mill II HIM II M Mill 

SEQIDN012 - AACGTCAAAGATGCTCAGAATGACCACTTCCT -1712 



SEQIDN07 - ATATCGCATTCTGTGCGCGCCTGTTGTGTGGGAGAATTCGACCGAACGCG -2641 



SEQIDN07 - AATTGTACTTGCCCGTGCTGACCCAATGGTACAAATTCGGTCCCGACTTT -2691 



SEQIDN07 - GACACCAAGCCTCTGGAAGGAGCGATGAACGGAGGGGACCGAATTTACAA -2741 



SEQIDN07 - CTACCCTGTACCGCAAAGTGAATCACCAATCTTCGTGAGAGAAGGTGCGA -2791 



SEQIDN07 - TTCTCCCTACCCGCTACACGTTGAACGGTGAAAACAAATCATTGAACACG -2841 
SEQIDN07 - TACACGGACGAAGATCCGTTGGTGTTTGAAGTATTCCCCCTCGGAAACAA -2891 
SEQIDN07 - CCGTGCCGACGGTATGTGTTATCTTGATGATGGCGGTGTGACCACCAATG -2941 



SEQIDN07 - CTGAAGACAATGGCAAGTTCTCTGTCGTCAAGGTGGCAGCGGAGCAGGAT -2991 
SEQIDN07 - GGTGGTACGGAGACGATAACGTTTACGAATGATTGCTATGAGTACGTTTT -3041 



SEQIDN07 - CGGTGGACCGTTCTACGTTCGAGTGCGCGGCGCTCAGTCGCCGTCGAACA -3091 



SEQIDN07 - TCCACGTGTCTTCTGGAGCGGGTTCTCAGGACATGAAGGTGAGCTCTGCC -3141 



SEQIDN07 - ACTTCCAGGGCTGCGCTGTTCAATGACGGGGAGAACGGTGATTTCTGGGT -3191 



SEQIDN07 - TGACCAGGAGACAGATTCTCTGTGGCTGAAGTTGCCCAACGTTGTTCTCC -3241 



SEQIDN07 - CGGACGCTGTGATCACAATTACCTAA -3267 



Identity : 1272 (74.30%) 

Number of gaps inserted in SEQIDN07: 1 

Number of gaps inserted in SEQIDN012: 5 
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